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SI NQLE-CHATN FORMS OF THR GT YCOPROTFTN HORMONE OTIARTFT 

Cross-Reference to R elated Applicatinns 

This application is a continuation-in-part of U.S. Serial No. 08/853,524 filed 
9 May 1997 which is a continuation-in-part of U.S. Serial No. 08/351,591 filed 
7 December 1994 which is a continuation-in-part of U.S. Serial No. 08/334,628 filed 4 
November 1994 which is a continuation-in-part of U.S. Serial No. 08/310,590 filed 
22 September 1994 which is a continuation-in-part of U.S. Serial No. 08/289,396 filed 
12 August 1994. This application is also a continuation-in-part of U.S. Serial No. 
08/199,382 filed 18 February 1994. The disclosures of the above-mentioned applications 
are incorporated herein by reference. 

Acknowledgment of Gnv emment Support 

This invention was made v^dth government support under NIH Contract No. NOl- 
HD-9-2922, awarded by the National Institutes of Health. The government has certain 
rights in this invention. 

Technical Field 

The invention relates to the field of protein engineering and the glycoprotein 
hormones which occur normally as heterodimers. More specifically, the invention 
concerns single-chain forms of chorionic gonadotropin (CG), thyroid stimulating 
hormone (TSH), luteinizing hormone (LH), and follicle stimulating hormone (FSH). 

Background Art 

In humans, four important glycoprotein hormone heterodimers (LH, FSH, TSH 
AND CG) have identical a subunits and differing P subunits. Three of these hormones 
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PCT application WO90/09800, published 7 September 1990, and incorporated 
herein by reference, describes a number of modified forms of these hormones. One 
important modification is C-terminal extension of the p subunit by the carboxy terminal 
peptide of human chorionic gonadotropin or a variant thereof Other muteins of these 
hormones are also described. The relevant positions for the CTP are from any one of 
positions 1 12- 11 8 to position 145 of the P subunit of human chorionic gonadotropin. The 
PCT applicatiou describes variants of the CTP extension obtained by conservative amino 
acid substitutions such that the capacity of the CTP to alter the clearance characteristics is 
not destroyed. In addition, U.S. Serial No. 08/049,869 filed 20 April 1993, incorporated 
herein by reference, describes modifying these hormones by extension or insertion of the 
CTP at locations other than the C-terminus and CTP fi^agments shorter than the sequence 
extending fi*om positions 1 12-1 18 to 145. 

The CTP-extended p subunit of FSH is also described in two papers by applicants 
herein: LaPolt, P.S. et a/. ; Endocrinology (1992) 131:2514-2520 and Fares, F.A. et aL \ 
Proc Natl Acad Sci USA (1992) 89:4304-4308. Both of these papers are incorporated 
herein by reference. 

The crystal structure of the heterodimeric form of human chorionic gonadotropin 
has now been published in more or less contemporaneous articles; one by Lapthorn, A.J. 
et al Nature (1994) 369:455-461 and the other by Wu, H. et al Structure (1994) 2:545- 
558. The results of these articles are summarized by Patel, D.J. Nature (1994) 369:438- 
439. 

At least one instance of preparing a successfial single-chain form of a heterodimer 
is now known. The naturally occurring sweetener protein, monellin, is isolated from 
serendipity berries in a heterodimeric form. Studies on the crystal structure of the 
heterodimer were consistent with the proposition that the C-terminus of the B chain could 
be linked to the N-terminus of the A chain through a linker which preserved the spatial 
characteristics of the heterodimeric form. Such a linkage is advantageous because, for use 
as a sweetener protein, it would be advantageous to provide this molecule in a form stable 
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at high temperatures. This was successfully achieved by preparing the single-chain form, 
thus impeding heat denaturation, as described in U.S. patent 5,264,558. 

PCT application W09 1/1 6922 pubHshed 14 November 1991 describes a 
multiplicity of chimeric and otherwise modified forms of the heterodimeric glycoprotein 
5 hormones. In general, the disclosure is focused on chimeras of a subunits or P subunits 
involving portions of various a or |3 chains respectively. One construct simply hsted in 
this application, and not otherwise described, fuses substantially all of the p chain of 
human chorionic gonadotropin to the a subunit preprotein, i.e., including the secretory 
signal sequence for this subunit. This construct falls outside the scope of the present 
10 invention since the presence of the signal sequence intervening between the p and a chains 
fails to serve as a linker moiety as defined and described herein. 

It has now been found that the normally heterodimeric glycoprotein hormones 
retain their properties when in single-chain form, including single-chain forms that contain 
the various CT^ extensions and insertions described above. 

15 

Disclosure of the Invention 

The invention provides single-chain forms of the glycoprotein hormones, at least 
some of which hormones are found in most vertebrate species. The single-chain forms of 
the invention may either be glycosylated, partially glycosylated, or nonglycosylated and the 

2 0 a and p chains that occur in the native glycoprotein hormones or variants of them may 

optionally be linked through a linker moiety. Particularly preferred linker moieties include 
the carboxy terminal peptide (CTP) unit either as a complete unit or only as a portion 
thereof, as well as shorter linkers of 1-16 amino acids. The resulting single-chain 
hormones either retain the activity of the unmodified heterodimeric form or are 

2 5 antagonists of this activity. 

Thus, in one aspect, the invention is directed to a glycosylated or nonglycosylated 
protein which comprises the amino acid sequence of the a subunit common to the 
glycoprotein hormones linked covalently, optionally through a linker moiety, to the amino 
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acid sequence of the p subunit of one of said hormones, or variants of said amino acid 
sequences wherein said variants are defined herein. 

The availability of single-chain forms preserves conformation so that the entire 
portions of the subunits that make up the single-chain forms are unnecessary. Thus, the 
5 invention includes single-chain forms of fi*agments of the subunits wherein the single-chain 
forms retain the biological activity exhibited by the single-chain forms of the complete 
subunits. 

In other aspects, the invention is directed to recombinant materials and methods to 
produce the single-chain proteins of the invention, to pharmaceutical compositions 
10 containing them; to antibodies specific for them; and to methods for their use. 

Brief Description of the Drawings 

Figure 1 shows the construction of a Sail bounded DNA fi-agment fusing the third 
exon of cop with the second exbn encoding the a subunit. 
15 Figure 2 shows the amino acid sequence and numbering of positions 1 12- 145 of 

human CGp. 

Figure 3 shows the results of a competition binding assay for FSH receptor by 
various FSH analogs. 

Figure 4 shows the results of signal transduction assay with respect to FSH 
2 0 receptor for various FSH analogs. 

Figures 5-12 illustrate the coding sequences for single-chain gonadotropin analogs 
1-8 and relevant primers (underlined). 

Figures 12-14 illustrate the coding sequences for single-chain gonadotropin 
analogs 9-10 and their cassettes (underlined). 
2 5 Figure 1 5 shows the preparation of an a subunit encoding region lacking 

oligosaccharide binding sites. 

Figure 1 6 shows the preparation of a P subunit encoding region lacking N-linked 
oligosaccharide binding sites. 
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Figure 17 shows the sequence encoding a single-chain gonadotropin analog 

No. la. 

Modes of Carrying Out the Invention 
5 Four "glycoprotein*' hormones in humans provide a family which includes human 

chorionic gonadotropin (hCG), follicle stimulating hormone (FSH), luteinizing hormone 
(LH), and thyroid stimulating hormone (TSH). As used herein, "glycoprotein hormones" 
refers to the members of this family. All of these hormones are heterodimers comprised of 
a subunits which, for a given species, are identical in amino acid sequence among the 
10 group, and p subunits which differ according to the member of the family. Thus, normally 
these glycoprotein hormones occur as heterodimers composed of a and p subunits 
associated with each other but not covalently linked. Most vertebrates produce FSH, 
TSH and LH; chorionic gonadotropin has been found only in primates, including humans, 
and horses. 

15 Thus, this hormone "quartet" is composed of heterodimers wherein the a and p 

subunits of each are encoded in different genes and are separately synthesized by the host. 
The host then assembles the separately synthesized subunits into a non-covalently linked 
heterodimeric complex. In this manner, the heterodimers of this hormone quartet differ 
from heterodimers such as insulin which is synthesized from a single gene (in this case 

2 0 with an intervening "pro" sequence) and the subunits are covalently coupled using 

disulfide linkages. This hormone quartet is also distinct fi*om the immunoglobulins which 
are assembled from different loci, but are covalently bound through disulfide linkages. On 
the other hand, monellin, which is, however, a plant protein, is held together through 
noncovalent interaction between its A and B chains. It is not known at present whether 

25 the two chains are encoded on separate genes. 

Thus, a variety of factors is influential in determining the behavior of biologically 
active compounds which are dimers formed from subunits that are identical or different. 
The subunits may be covalently or noncovalently linked; they may be synthesized by the 
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same or different genes; and they may or may not contain, in their precursor forms, a 
"pro" sequence linking the two members of the dimer. Based on the results obtained with 
the single-chain forms of the glycoprotein hormone quartet herein, it is apparent that 
single-chain forms of the biologically active dimers interleukin-12, interleukin-3 (IL-12 
5 and IL-3), inhibin, tumor necrosis factor (TNF), and transforming growth factor (IGF) 
will also be biologically active. 

The single-chain forms of the heterodimers or homodimers have a number of 
advantages over their dimeric forms. First, they are generally more stable. LH, in 
particular, is noted for its instability and short half-life. Second, problems of recombinant 

10 production are reduced since only a single gene need be transcribed, translated and 

processed. This is particularly important for expression in bacteria. Third, of course, they 
provide an alternate form thus permitting fine tuning of activity levels and of in vivo half 
lives. Finally, single chain forms are unique starting materials for identifying truncated 
forms with the activity of the dimer. The linkage between the subunits permits the protein 

15 to be engineered without disturbing the overall folding of the protein. 

With respect to this last point, it will be evident that because the conformation is 
stabilized in the single-chain forms, less than the complete single-chain conjugate of the 
subunits that compose it will generally be needed. Therefore, the invention covers 
fragments of the single-chain proteins that retain biological activity; these fragments may 

2 0 be visualized as single-chain forms obtained from fragments of the subunits per se. 

Features of the Members of the Quartet 

The p subunit of hCG is substantially larger than the other p subunits in that it 
contains approximately 34 additional amino acids at the C-terminus referred to herein as 
25 the carboxy terminal portion (CTP) which, when glycosylated at the 0-linked sites, is 

considered responsible for the comparatively longer serum half-life of hCG as compared to 
other gonadotropins (Matzuk, M. et al^ Endocrinol (1989) 126:376). In the native 
hormone, this CTP extension contains four mucin-like O-linked oligosaccharides. 
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In one embodiment of the present invention, the a and p chains of the glycoprotein 
hormones are coupled into a single-chain proteinaceous material where the a and p chain 
are covalently linked, optionally through a linker moiety. The linker moiety may include 
further amino acid sequence, and in particular the CTP units described herein can be 
5 advantageously included in the linker. In addition, the linker may include peptide or 
nonpeptide drugs which can be targeted to the receptors for the hormones. 

In addition to the head-to-tail configuration that is achievable by simply coupling 
the two peptide chains through a peptide bond, the a and p chains can be linked head-to- 
head or tail-to-tail. Head to head and tail to tail couplings involve synthetic chemistry 

10 using standard techniques to link two carboxyl or two amino groups through a linker 

moiety. For example, two amino groups may be linked through an anhydride or through 
any dicarboxylic acid derivative; two carboxyl groups can be linked through diamines or 
diols using standard activation techniques. However, the most preferred form is a heaito 
tail configuration wherein standard peptide linkages suffice and the single-chain compound 

15 can be prepared as a fusion protein recombinantly or using synthetic peptide techniques 
either in a single chain or, preferably, ligating individual portions of the entire sequence. 
Of course, if desired, peptide or non-peptide linker moieties can be used in this case as 
well, but this is unnecessary and the convenience of recombinant production of the single- 
chain protein would suggest that embodiments that permit this method of production 

2 0 comprise by far the most preferred approach. 

When a head-to-tail configuration is employed, linkers may consist essentially of 
additional peptide sequence. As is the case with the heterodimers, the two p chains may 
be linked through a CTP unit as further described below. Thus, possible embodiments of 
the invention include, with the N-terminus at the lefl:, a-FSHp, pFSH-a, a-pLH, a-CTP- 

25 pLH, pLH-CTP-a, CTP-pLH-CTP-a; and the like. 

The following definitions may be helpful in describing the single-chain forms of the 
molecules. 
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As used herein, a subunit, and FSH, LH, TSH, and CG p subunits as well as the 
heterodimeric forms have in general their conventional definitions and refer to the proteins 
having the amino acid sequences known in the art per se, or allelic variants thereof, 
regardless of the glycosylation pattern exhibited. 
5 "Native" forms of these peptides are those which have the amino acid sequences 

isolated from the relevant vertebrate tissue, and have these known sequences per se, or 
their allelic variants. 

"Variant" forms of these proteins are those which have deliberate alterations in 
amino acid sequence of the native protein produced by, for example, site-specific 

10 mutagenesis or by other recombinant manipulations, or which are prepared synthetically. 

These alterations consist of 1-10, preferably 1-8, and more preferably 1-5 amino 
acid changes, including deletions, insertions, and substitutions, most preferably 
conservative amino acid substitutions as defined below. The resulting variants must retain 
activity which affects the corresponding activity of the native hormone - i.e., either they 

15 must retain the biological activity of the native hormone directly, or they must behave as 
antagonists, generally by virtue of being able to bind the receptors for the native hormones 
but lacking the ability to effect signal transduction. For example, it is known that if the 
glycosylation site at position 52 of the a subunit is removed by an amino acid substitution, 
therefore preventing all glycosylation at that site, the hormones which are heterodimers 

2 0 with this altered a subunit are generally agonists and are able to bind receptors preventing 
the native hormone from doing so in competition. (On the other hand, the glycosylation 
site of the a subunit at position 78 appears not greatly to affect the activity of the 
hormones.) Other alterations in the amino acid sequence may also result in antagonist 
rather than agonist activity for the variant. 

25 One set of preferred variants are those wherein the glycosylation sites of either the 

a or p subunits or both have been altered. The a subunit contains two glycosylation sites, 
one at position 52 and the other at position 78, and the effect of alterations of these sites 
on activity has just been described. Similarly, the p subunits generally contain two 
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N-linked glycosylation sites (at positions that vary somewhat with the nature of the p 
chain) and similar aUerations can be made at these sites. The CTP extension of hCG 
contains four 0-linked glycosylation sites, and conservative mutations at the serine 
residues (e.g., conversion of the serine to alanine) destroys these sites. Destruction of the 
5 0-linked glycosylation sites may effect conversion of against activity to antagonist 
activity. 

Finally, alterations in amino acid sequence that are proximal to the N-linked or 
0-linked glycosylation sites influence the nature of the glycosylation that is present on the 
resulting molecule and also alter activity. 

10 Alterations in amino acid sequence also include both insertions and deletions. 

Thus, truncated forms of the hormones are included among variants, e.g., mutants of the a 
subunit which are lacking some or all of the amino acids at positions 85-92 at the 
C-terminus. In addition, a subunits with 1-10 amino acids deleted from the N-terminus: 
are included. Some useful variants of the hormone quartet described herein are set forth in 

15 U.S. Patent 5,177,193 issued 5 January 1993 and incorporated herein by reference. As 

shown therein, the glycosylation patterns can be altered by destroying the relevant sites or, 
in the alternative, by choice of host cell in which the protein is produced. 

As explained above, the single chain forms are convenient starting materials for 
various engineered muteins. Such muteins include those with non-critical regions altered 

2 0 or removed. Such deletions and alterations may comprise entire loops, so that sequences 
of considerably more than 10 amino acids may be deleted or changed. The single chain 
molecules must, however, retain at least the receptor binding domains and/or the regions 
involved in signal transduction. 

There is considerable literature on variants of the hormone quartet described herein 

2 5 and it is clear from this literature that a large number of possible variants which result both 
in agonist and antagonist activity can be prepared. Such variants are disclosed, for 
example, in Chen, F. et al Molec Endocrinol (1992) 6:914-919; Yoo, J. et al J Biol 
Chem (1993) 268:13034-13042; Yoo, J. et al J Biol Chem (1991) 266: 17741-17743; 
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Puett etal Glycoprotein Hormones . Lusbader, J.W. etai EDS, Springer Verlag New 
York (1994) 122-134; Kuetmann, H.T. etal (ibid) pages 103-117; Erickson, L.D, etai 
Endocrinology (1990) 126:2555-2560; and Bielinska, M. etai J Cell Biol (1990) 
iii:330a (Abstract 1844). 
5 As described hereinaboye, one method of constructing effectiye antagonists is to 

prepare a single-chain molecule containing two p subunits of the same or different member 
of the glycoprotein quartet. Particularly preferred yariants of these single-chain forms 
include those wherein one or more cystine-link is deleted, typically by substituting a 
neutral amino acid for one or both cysteines which participate in the link. Particularly 

10 preferred cystine links which may be deleted are those between positions 26 and 1 10 and 
between positions 23 and 72. 

In addition, it has been demonstrated that the p subunits of the hormone quartet 
can be constructed in chimeric forms so as to provide biological functions of both 
components of the chimera, or, fn general, hormones of altered biological function. Thus, 

15 chimeric molecules which exhibit both FSH and LH/CG activities can be constructed as 
described by Moyle, Proc Natl Acad Sci (1991) 88:760-764; Moyle, Nature (1994) 
368:251-255. As disclosed in these papers, substituting amino acids 101-109 of FSH-p 
for the corresponding residues in the CG-p subunit yields an analog with both hCG and 
FSH activity. 

2 0 Although it is recognized that glycosylation pattern has a profound influence on 

activity both qualitatively and quantitatively, for convenience the terms FSH, LH, TSH, 
and CG P subunits refers to the amino acid sequence characteristic of the peptides, as does 
"a subunit." When only the p chain is referred to, the terms will be, for example, FSHp; 
when the heterodimer is referred to, the simple term "FSH" will be used. It will be clear 

2 5 from the context in what manner the glycosylation pattern is affected by, for example, 
recombinant expression host or alteration in the glycosylation sites. Forms of the 
glycoprotein with specified glycosylation patterns will be so noted. 
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As used herein "peptide" and "protein" are used interchangeably, since the length 
distinction between them is arbitrary. 

As stated above, the subunits employed in forming the single-chain conjugates with 
or without linking moieties may represent the complete amino acid sequences of the 
5 subunits or only portions thereof Single-chain conjugates of a and p subunits are 

composed of these subunits per se or of those fragments of the subunits which result in a 
single-chain form with biological activity comparable to that exhibited by the single chain 
composed of the corresponding complete subunits. 

In the single-chain forms of the present invention, the a and/or p chain may 
10 contain a CTP extension inserted into a noncritical region. 

"Noncritical" regions of the a and p subunits are those regions of the molecules 
not required for biological activity (including agonist and antagonist activity). In general, 
these regions are removed from binding sites, precursor cleavage sites, and catalytic 
regions. Regions critical for inducing proper folding, binding to receptors, catalytic 
15 activity and the like should be avoided; similarly, regions which are critical to assure the 
three-dimensional conformation of the protein should be avoided. It should be noted that 
some of the regions which are critical in the case of the dimer become non-critical in the 
single chain forms since the conformational restriction imposed by the single chain may 
obviate the necessity for these regions. The ascertainment of noncritical regions is readily 
2 0 accomplished by deleting or modifying candidate regions and conducting an appropriate 
assay for the desired activity. Regions where modifications result in loss of activity are 
critical; regions wherein the alteration results in the same or similar activity (including 
antagonist activity) are considered noncritical 

It should be emphasized, that by "biological activity" is meant activity which is 
25 either agonistic or antagonistic to that of the native hormones. Thus, certain regions are 
critical for behavior of a variant as an antagonist, even though the antagonist is unable to 
directly provide the physiological effect of the hormone. 
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For example, for the a subunit, positions 33-59 are thought to be necessary for 
signal transduction and the 20 amino acid stretch at the carboxy terminus is needed for 
signal transduction/receptor binding. Residues critical for assembly with the p subunit 
include at least residues 33-58, particularly 37-40. 
5 Where the noncritical region is "proximal" to the N- or C-terminus, the insertion is 

at any location within 10 amino acids of the terminus, preferably within 5 amino acids, and 
most preferably at the terminus per se. 

In general, "proximal" is used to indicate a position which is within 10 amino acids, 
preferably within five amino acids, of a referent position, and most preferably at the 

10 referent position per se. Thus, certain variants may contain substitutions of amino acids 
"proximal" to a glycosylation site; the definition is relevant here. In addition, the a and P 
subunits may be linked to each other at positions "proximal" to their N- or C-termini. 

As used herein, the "CTP unit" refers to an amino acid sequence found at the 
carboxy terminus of human chorionic gonadotropin p subunit which extends from amino 

15 acid 1 12-1 18 to residue 145 at the C-terminus or to a portion thereof Thus, each 

"complete" CTP unit contains 28-34 amino acids, depending on the N-terminus of the 
CTP. The native sequence of positions 1 12-145 is shown in Figure 2. 

By a "partial" CTP unit is meant an amino acid sequence which occurs between 
positions 1 12-1 18 to 145 inclusive, but which has at least one amino acid deleted from the 

2 0 shortest possible "complete" CTP unit (i.e. from positions 118-145). The "partial" CTP 
units included in the invention preferably contain at least one 0-glycosylation site if 
agonist activity is desired. Some nonglycosylated forms of the hormones are antagonists 
and are usefiil as such. The CTP unit contains four such sites at the serine residues at 
positions 121 (site 1); 127 (site 2); 132 (site 3); and 138 (site 4). The partial forms of 

25 CTP usefiil in agonists of the invention will contain one or more of these sites arranged in 
the order in which they appear in the native CTP sequence. Thus, the "partial" CTP unit 
employed in agonists of the invention may include all four glycosylation sites; sites 1, 2 
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and 3; sites 1, 2 and 4; sites 1, 3 and 4; sites 2, 3 and 4; or simply sites 1 and 2; 1 and 3; 1 
and 4; 2 and 3; 2 and 4; or 3 and 4; or may contain only one of sites 1, 2, 3 or 4. 

By "tandem" inserts or extensions is meant that the insert or extension contains at 
least two "CTP units". Each CTP unit may be complete or a fragment, and native or a 
5 variant. All of the CTP units in the tandem extension or insert may be identical, or they 
may be different from each other. Thus, for example, the tandem extension or insert may 
generically be partial-complete; partial-partial; partial-complete-partial; complete- 
complete-partial, and the like wherein each of the noted partial or complete CTP units may 
independently be either a variant or the native sequence. 

10 The "linker moiety" is a moiety that joins the a and |3 sequences without interfering 

with the activity that would otherwise be exhibited by the same a and p chains as members 
of a heterodimer, or which alters that activity to convert it from agonist to antagonist 
activity The level of activity may change within a reasonable range, but the presence of 
the linker cannot be such so as to deprive the single-chain form of both substantial agonist 

15 and substantial antagonist activity. The single-chain form must remain as a single-chain 
form when it is recovered from its production medium and must exhibit activity pertinent 
to the hormonal activity of the heterodimer, the elements of which form its components. 

Variants 

2 0 The hormone subunits and the CTP units may correspond exactly to the native 

hormone or CTP sequence, or may be variants. The nature of the variants has been 
defined hereinabove. In such variants, 1-10, preferably 1-8, and most preferably 1-5 of the 
amino acids contained in the native sequence are substituted by a different amino acid 
compared to the native amino acid at that position, or 1-10, more preferably 1-8 and most 

25 preferably 1-5 amino acids are simply deleted or combination of these. As pointed out 
above, when non-critical regions of the single chain forms are identified, in particular, 
through detecting the presence of non-critical "loops", the number of amino acids altered 
by deletion or substitution may be increased to 20 or 30 or any arbitrary number 
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depending on the length of amino acid sequence in the relevant non-critical region. Of 
course, deletion or substitutions in more than one non-critical region results in still greater 
numbers of amino acids in the single chain forms being affected and substitution and 
deletions strategies may be used in combination. The substitutions or deletions taken 
cumulatively do not result in substantial elimination of agonist or antagonist activity 
associated with the hormone. Substitutions by conservative analogs of the native amino 
acid are preferred. 

"Conservative analog" means, in the conventional sense, an analog wherein the 
residue substituted is of the same general amino acid category as that for which 
substitution is made. Amino acids have been classified into such groups, as is understood 
in the art, by, for example, Dayhoflf, M. et ai. Atlas of Protein Sequences and Structure 
(1972) 5:89-99. In general, acidic amino acids fall into one group; basic amino acids into 
another; neutral hydrophilic amino acids into another; and so forth. 

More specifically, amino acid residues can be generally subclassified into four 
major subclasses as follows: 

Acidic: The residue has a negative charge due to loss of H ion at physiological pH 
and the residue is attracted by aqueous solution so as to seek the surface positions in the 
conformation of a peptide in which it is contained when the peptide is in aqueous medium 
at physiological pH. 

Basic: The residue has a positive charge due to association with H ion at 
physiological pH and the residue is attracted by aqueous solution so as to seek the surface 
positions in the conformation of a peptide in which it is contained when the peptide is in 
aqueous medium at physiological pH. 

Neutral/nonpolar: The residues are not charged at physiological pH and the 
residue is repelled by aqueous solution so as to seek the inner positions in the 
conformation of a peptide in which it is contained when the peptide is in aqueous medium. 
These residues are also designated "hydrophobic" herein. 
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Neutral/polar: The residues are not charged at physiological pH, but the residue is 
attracted by aqueous solution so as to seek the outer positions in the conformation of a 
peptide in which it is contained when the peptide is in aqueous medium. 

It is understood, of course, that in a statistical collection of individual residue 
5 molecules some molecules will be charged, and some not, and there will be an attraction 
for or repulsion from an aqueous medium to a greater or lesser extent. To fit the 
definition of "charged," a significant percentage (at least approximately 25%) of the 
individual molecules are charged at physiological pH. The degree of attraction or 
repulsion required for classification as polar or nonpolar is arbitrary and, therefore, amino 
10 acids specifically contemplated by the invention have been classified as one or the other. 
Most amino acids not specifically named can be classified on the basis of known behavior. 

Amino acid residues can be fiirther subclassified as cyclic or noncyclic, and 
aromatic or nonaromatic, self-explanatory classifications with respect to the side chain 
substituent groups of the residues, and as small or large. The residue is considered small if 
15 it contains a total of 4 carbon atoms or less, inclusive of the carboxyl carbon. Small 
residues are, of course, always nonaromatic. 

For the naturally occurring protein amino acids, subclassification according to the 
foregoing scheme is as follows. 

Acidic : Aspartic acid and Glutamic acid; 
2 0 Basic/noncyclic : Arginine, Lysine; 

Basic/cyclic : Histidine; 

Neutral/polar/ small : Glycine, serine, cysteine; 

Neutral/ nonpolar/ small : Alanine; 

Neutral/polar/large/nonaromatic : Threonine, Asparagine, Glutamine; 
25 Neutral/polar/large aromatic : Tyrosine; 

Neutral/no npolar/large/nonaromatic : Valine, Isoleucine, Leucine, Methionine; 
Neutral/nonpolar/large/aromatic : Phenylalanine, and Tryptophan. 
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The gene-encoded secondary amino acid proline, although technically within the 
group neutral/nonpolar/ large/cyclic and nonaromatic, is a special case due to its known 
effects on the secondary conformation of peptide chains, and is not, therefore, included in 
this defined group. 

5 If the single-chain proteins of the invention are constructed by recombinant 

methods, they will contain only gene encoded amino acid substitutions; however, if any 
portion is synthesized by standard, for example, solid phase, peptide synthesis methods 
and ligated, for example, enzymatically, into the remaining protein, non-gene encoded 
amino acids, such as aminoisobutyric acid (Aib), phenylglycine (Phg), and the like can also 
10 be substituted for their analogous counterparts. 

These non-encoded amino acids also include, for example, p-alanine ((3- Ala), or 
other omega- amino acids, such as 3 -amino propionic, 4-amino butyric and so forth, 
sarcosine (Sar), ornithine (Orn), citrulline (Cit), t-butylalanine (t-BuA), t-butylglycine (t- 
BuG), N-methylisoleucine (N-Melle), and cyclohexylalanine (Cha), norleucine (Nle), 
15 cysteic acid (Cya) 2-naphthylalanine (2-Nal); l,2,3,4-tetrahydroisoquinoline-3-carboxylic 
acid (Tic); mercaptovaleric acid (Mvl); (3-2-thienylalanine (Thi); and methionine sulfoxide 
(MSO). These also fall conveniently into particular categories. 
Based on the above definitions, 
Sar and p-Ala and Aib are neutral/nonpolar/ small; 
2 0 t-BuA, t-BuG, N-Melle, Nle, Mvl and Cha are 

neutral/nonpolar/large/nonaromatic; 
Orn is basic/noncyclic; 
Cya is acidic; 

Cit, Acetyl Lys, and MSO are neutral/polar/ large/nonaromatic; and 
25 Phg, Nal, Thi and Tic are neutral/nonpolar/large/ aromatic. 

The various omega-amino acids are classified according to size as 
neutral/nonpolar/small (P-Ala, i.e., 3-aminopropiomc, 4-aminobutyric) or large (all 
others). 
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Thus, amino acid substitutions other than those encoded in the gene can also be 
included in peptide compounds within the scope of the invention and can be classified 
within this general scheme according to their structure. 

Preferred Embodiments of the Single-Chain Hormones 

The single-chain hormones of the invention are most efficiently and economically 
produced using recombinant techniques. Therefore, those forms of a and p chains, CTP 
units and other linker moieties which include only gene-encoded amino acids are 
preferred. It is possible, however, as set forth above, to construct at least portions of the 
single-chain hormones using synthetic peptide techniques or other organic synthesis 
techniques and therefore variants which contain nongene-encoded amino acids are also 
within the scope of the invention. 

In the most preferred embodiments of the single-chain hormones of the invention, 
the C-terminus of the P subunit is covalently linked, optionally through a linker, to the 
N-terminus of the mature a subunit; forms wherein the C-terminus of the a subunit is 
linked to the N-terminus of the p subunit are also useful, but may have less activity either 
as antagonists or agonists of the relevant receptor. The linkage can be a direct peptide 
linkage wherein the C-terminal amino acid of one subunit is directly linked through the 
peptide bond to the N-terminus of the other; however, in many instances it is preferable to 
include a linker moiety between the two termini. In many instances, the linker moiety will 
provide at least one p turn between the two chains. The presence of proline residues in 
the linker may therefore be advantageous. 

As described above, the N-terminus of the a chain may also be coupled to the 
N-terminus of the p chain or the C-terminus of the a to the C-terminus of the p chain in 
any case through a linker unit. 

It should be understood that in discussing linkages between the termini of the 
subunits comprising the single chain forms, one or more termini may be altered by 
substitution and/or deletion as described above. 
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While the head-to-head, tail-to-tail and head-to-tail configurations of the single- 
chain heterodin'er have been described, the linkage between the two subunits may also 
occur at positions not precisely at the N- or C-terminus of each member but at positions 
proximal thereto. 

In one particularly preferred set of embodiments, the linkage is head-to-tail and the 
linker moiety will include one or more CTP units and/or variants or truncated forms 
thereof Preferred forms of the CTP units used in such linker moieties are described 
hereinbelow. 

Further, the linker moiety may include a drug covalently, preferably releasably, 
bound to the linker moiety. Means for coupling the drug to the linker moiety and for 
providing for its release are conventional. 

In addition to their occurrence in the linker moiety, CTP and its variants and 
truncations may also be included in any noncritical region of the subunits making up the- 
single-chain hormone. The nature of these inclusions, and their positions, is set forth in 
detail in the parent application herein. 

While CTP units are preferred inclusions in the linker moiety, it is understood that 
the linker may be any suitable covalently bound material which provides the appropriate 
spatial relationship between the a and p subunits. Thus, for head-to-tail configurations the 
linker may generally be a peptide comprising an arbitrary number, but typically less than 
100, more preferably less than 50 amino acids which has the proper 
hydrophiiicity/hydrophobicity ratio to provide the appropriate spacing and confirmation in 
solution. In general, the linker should be on balance hydrophilic so as to reside in the 
surrounding solution and out of the way of the interaction between the a and p subunits. 
It is preferable that the linker include P turns typically provided by proline residues. Any 
suitable polymer, including peptide linkers, with the above-described correct 
characteristics may be used. 

One particular linker moiety that is not included within the scope of the invention 
is that which includes a signal peptide immediately upstream of the downstream subunit. 
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Particularly preferred embodiments of the single-chain hormones of the invention 

include: 

pFSH-a; 
pLH-a; 
5 pTSH-a; 

pCG-ot; 
pFSH-CTP-a; 
(3LH-CTP-a; 
pCG-CTP-a; 
10 pFSH-CTP-CTP-a; 

PLH-CTP-CTP-a; 
pCG-CTP-CTP-a; 

and the like. Also particularly preferred are the human forms of the subunits. In the 
above constructions, "CTP" refers to CTP or its variants or truncations as further 
15 explained in the paragraph below. 



Preferred Embodiments of CTP Units 

The notation used for the CTP units of the invention is as follows: for portions of 
the complete CTP unit, the positions included in the portion are designated by their 

20 number as they appear in Figure 2 herein. Where substitutions occur, the substituted 

amino acid is provided along with a superscript indicating its position. Thus, for example, 
CTP (120-143) represents that portion of CTP extending from positions 120 to 143; CTP 
(120-130; 136-143) represents a fused amino acid sequence lacking positions 118-119, 
131-135, and 144-145 of the native sequence. CTP (Arg'^^) refers to a variant wherein 

25 the lysine at position 122 is substituted by an arginine; CTP (Ile^^") refers to a variant 
wherein the leucine at position 134 is substituted by isoleucine. CTP (Val'^Val'*^) 
represents a variant wherein two substitutions have been made, one for the leucine at 
position 128 and the other for the isoleucine at position 142. CTP (120-143; Ile^^^ Ala'^°) 
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represents the relevant portion of the CTP unit where the two indicated substitutions have 
been made. 

Also preferred among variants of CTP are those wherein one or more of the 
0-linked glycosylation sites have been altered or deleted. One particularly preferred 
means of altering the site to prevent glycosylation is substitution of an alanine residue for 
the serine residue in these sites. 

Particularly preferred are those CTP units of the following formulas: 





#1 


CTP 


(116-132) 




#2 


CTP 


(118-128; 130-135) 


10 


#3 


CTP 


(117-142) 




#4 


CTP 


(116-130) 




#5 


CTP 


(116-123; 137-145) 




#6 


CTP 


(115-133; 141-145) 




#7 


CTP 


(117-140, Ser'^^ Gln'^°) 


15 


#8 


CTP 


(125-143, Ala'^°) 




#9 


CTP 


(135-145, Glu'^^) 




#10 


CTP 


(131-143, Val"^ Val'*^) 




#11 


CTP 


(118-132) 




#12 


CTP 


(118-127) 


20 


#13 


CTP 


(118-145) 




#14 


CTP 


(115-132) 




#15 


CTP 


(115-127) 




#16 


CTP 


(115-145) 




#17 


CTP 


(112-145) 


25 


#18 


CTP 


(112-132) 




#19 


CTP 


(112-127) 
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Preferred Embodiments of the a and p Subunits 

Of course, the native forms of the a and p subunits in the single-chain form are 
among the preferred embodiments. However, certain variants are also preferred. 

In particular, variants of the a subunit in which the N-linked glycosylation site at 
5 position 52 is eliminated or altered by amino acid substitutions at or proximal to this site 
are preferred for antagonist activity. Similar modifications at the glycosylation site at 
position 78 are also preferred. Deletion of one or more amino acids at positions 85-92 
also affects the nature of the activity of hormones containing the a subunit and 
substitution or deletion of amino acids at these positions is also among the preferred 

10 embodiments. 

Similarly, the N-linked glycosylation sites in the p chain can conveniently be 
modified to eliminate glycosylation and thus affect the agonist or antagonist activity of the 
p chains. If CTP is present, either natively as in CG or by virtue of being present as a 
linker, the 0-linked glycosylation sites in this moiety may also be altered. 

15 Particular variants containing modified or deleted glycosylation sites are set forth 

in Yoo, J. etal. J Biol Chem (1993) 268:13034-13042; Yoo, J. etaL J Biol Chem (1991) 
266:17741-17743; and Bielinska, M. etaL J Cell Biol (1990) m :330a (all cited above) 
and in Matzuk, M.M. et al J Biol Chem (1989) 264:2409-2414; Keene, J.L. et al J Biol 
Chem (1989) 264:4769-4775: and Keene, J.L. etaL Mol Endocrinol (1989) 3:201 1-2017. 

2 0 Not only may the glycosylation sites per se be modified directly, but positions 

proximal to these sites are preferentially modified so that the glycosylation status of the 
mutant will be aflFected, For the a subunit, for example, variants in which amino acids 
between positions 50-60 are substituted, including both conservative and nonconservative 
substitutions, are favored, especially substitutions at positions 51, 53 and 55 because of 

2 5 their proximity to the glycosylation site at Asn52- 

Also preferred are mutants of the a subunit wherein lysine at position 91 is 
converted to methionine or glutamic acid. 
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Although the variants have been discussed in terms of variations in the individual 
subunits hereinabove, it will be recalled that the single chain forms of the dimer offer 
additional opportunities for modification. Specifically, regions that are critical to folding 
of the dimer may not be critical to the correct conformation of the single chain molecule 
5 and these regions are available for variation in the single chain form, although not 

described above in terms of individual members of the dimeric forms. Further, the single 
chain forms may be modified dramatically in the context of non-critical regions whose 
alteration and/or deletion do not affect the biological activity as described above. 

While for human use, the human forms of the glycoprotein quartet are desirable, it 
10 should be noted that the corresponding forms in other vertebrates are usefial in veterinary 
contexts. Thus, the FSH, TSH and LH subunits characteristic of bovine, ovine, equine, 
porcine, feline, canine, and other species are appropriate to indications affecting these 
species per se. 



15 Suitable Drugs 

Suitable drugs that may be included in the linker moiety include peptides or 
proteins such as insulin-like growth factors; epidermal growth factors; acidic and basic 
fibroblast grow1:h factors; platelet-derived grovrth factors; the various colony stimulating 
factors, such as granulocyte CSF, macrophage-CSF, and the like; as well as the various 

2 0 cytokines such as IL-2, IL-3 and the plethora of additional interleukin proteins; the various 
interferons; tumor necrosis factor; and the like. Peptide- or protein-based drugs have the 
advantage that they can be included in the single-chain and the entire construct can readily 
be produced by recombinant expression of a single gene. Also, small molecule drugs such 
as antibiotics, antiinflammatories, toxins, and the like can be used. 

2 5 In general, the drugs included within the linker moiety will be those desired to act 

in the proximity of the receptors to which the hormones ordinarily bind. Suitable 
provision for release of the drug from inclusion within the linker will be provided, for 
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example, by also including sites for enzyme-catalyzed lysis as further described under the 
section headed Preparation Methods hereinbelow. 

Other Modifications 

5 The single-chain proteins of the invention may be further conjugated or derivatized 

in ways generally understood to derivatize amino acid sequences, such as phosphorylation, 
glycosylation, deglycosylation of ordinarily glycosylated forms, modification of the amino 
acid side chains (e.g., conversion of proline to hydroxyproline) and similar modifications 
analogous to those post-translational events which have been found to occur generally. 

10 The glycosylation status of the hormones of the invention is particularly important. 

The hormones may be prepared in nonglycosylated form either by producing them in 
procaryotic hosts or by mutating the glycosylation sites normally present in the subunits 
and/or any CTP units that may be present. Both nonglycosylated versions and partially; 
glycosylated versions of the hormones can be prepared by manipulating the glycosylation 

15 sites. Normally, glycosylated versions are, of course, also included within the scope of the 
invention. 

As is generally known in the art, the single-chain proteins of the invention may also 
be coupled to labels, carriers, sohd supports, and the like, depending on the desired 
application. The labeled forms may be used to track their metabolic fate; suitable labels 

2 0 for this purpose include, especially, radioisotope labels such as iodine 131, technetium 99, 
indium 1 1 1, and the like. The labels may also be used to mediate detection of the single- 
chain proteins in assay systems; in this instance, radioisotopes may also be used as well as 
enzyme labels, fluorescent labels, chromogenic labels, and the like. The use of such labels 
is particularly helpful for these proteins since they are targeting agents receptor ligand. 

25 The proteins of the invention may also be coupled to carriers to enhance their 

immunogenicity in the preparation of antibodies specifically immunoreactive with these 
new modified forms. Suitable carriers for this purpose include keyhole limpet hemocyanin 
(KLH), bovine serum albumin (BSA> and diphtheria toxoid, and the like. Standard 
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coupling techniques for linking the modified peptides of the invention to carriers, including 
the use of bifur-ctional linkers, can be employed. 

Similar linking techniques, along with others, may be employed to couple the 
proteins of the invention to solid supports. When coupled, these proteins can then be used 
5 as affinity reagents for the separation of desired components with which specific reaction 
is exhibited. 



Preparation Methods 

Methods to construct the proteins of the invention are well known in the art. As 

10 set forth above, if only gene encoded amino acids are included, and the single-chain is in a 
head-to-tail configuration, the most practical approach at present is to synthesize these 
materials recombinantly by expression of the DNA encoding the desired protein. DNA 
containing the nucleotide sequence encoding the single-chain forms, including variants, _ 
can be prepared from native sequences. Techniques for site-directed mutagenesis, ligation 

15 of additional sequences, PCR, and construction of suitable expression systems are all, by 
now, well known in the art. Portions or all of the DNA encoding the desired protein can 
be constructed synthetically using standard solid phase techniques, preferably to include 
restriction sites for ease of ligation. Suitable control elements for transcription and 
translation of the included coding sequence can be provided to the DNA coding 

2 0 sequences. As is well known, expression systems are now available compatible with a 
wide variety of hosts, including procaryotic hosts such as bacteria and eucaryotic hosts 
such as yeast, plant cells, insect cells, mammalian cells, avian cells, and the like. 

The choice of host is particularly to posttranslational events, most particularly 
including glycosylation. The location of glycosylation is mostly controlled by the nature of 

2 5 the glycosylation sites within the molecule; however, the nature of the sugars occupying 
this site is largely controlled by the nature of the host. Accordingly, a fine-tuning of the 
properties of the hormones of the invention can be achieved by proper choice of host. 
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A particularly preferred form of gene for the a subunit portion, whether the a 
subunit is modified or unmodified, is the "minigene" construction. 

As used herein, the a subunit "minigene" refers to the gene construction disclosed 
in Matzuk, M.M., et al, Mol Endocrinol (1988) 2:95-100, in the description of the 
construction of pM^/CG a or pM^/a. This "minigene" is characterized by retention only of 
the intron sequence between exon 3 and exon 4, all upstream introns having been deleted. 
In the particular construction described, the N-terminal coding sequences which are 
derived from exon 2 and a portion of exon 3 are supplied from cDNA and are ligated 
directly through an Xbal restriction site into the coding sequence of exon 3 so that the 
introns between exons I and II and between exons II and III are absent. However, the 
intron between exons III and IV as well as the signals 3' of the coding sequence are 
retained. The resulting minigene can conveniently be inserted as a BamHI/Bglll segment. 
Other means for construction of a comparable minigene are, of course, possible and the 
definition is not restricted to the particular construction wherein the coding sequences are 
ligated through an Xbal site. However, this is a convenient means for the construction of 
the gene, and there is no particular advantage to other approaches, such as synthetic or 
partially synthetic preparation of the gene. The definition includes those coding sequences 
for the a subunit which retain the intron between exons III and IV, or any other intron and 
preferably no other introns. 

For recombinant production, modified host cells using expression systems are used 
and cultured to produce the desired protein. These terms are used herein as follows: 

A "modified" recombinant host cell, i.e., a cell "modified to contain" with the 
recombinant expression systems of the invention, refers to a host cell which has been 
altered to contain this expression system by any convenient manner of introducing it, 
including transfection, viral infection, and so forth. "Modified" refers to cells containing 
this expression system whether the system is integrated into the chromosome or is 
extrachromosomal. The "modified" cells may either be stable with respect to inclusion of 
the expression system or not. In short, "modified" recombinant host cells with the 
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expression system of the invention refers to cells which include this expression system as a 
resuh of their manipulation to include it, when they natively do not, regardless of the 
manner of effecting this incorporation. 

"Expression system" refers to a DNA molecule which includes a coding nucleotide 
5 sequence to be expressed and those accompanying control sequences necessary to effect 
the expression of the coding sequence. Typically, these controls include a promoter, 
termination regulating sequences, and, in some cases, an operator or other mechanism to 
regulate expression. The control sequences are those which are designed to be functional 
in a particular target recombinant host cell and therefore the host cell must be chosen so as 

10 to be compatible with the control sequences in the constructed expression system. 

If secretion of the protein produced is desired, additional nucleotide sequences 
encoding a signal peptide are also included so as to produce the signal peptide operably 
linked to the desired single-chain hormone to produce the preprotein. Upon secretion, the 
signal peptide is cleaved to release the mature single-chain hormone. 

15 As used herein "cells," "cell cultures," and "cell lines" are used interchangeably 

without particular attention to nuances of meaning. Where the distinction between them is 
important, it will be clear from the context. Where any can be meant, all are intended to 
be included. 

The protein produced may be recovered from the lysate of the cells if produced 
2 0 intracellularly, or from the medium if secreted. Techniques for recovering recombinant 
proteins from cell cultures are well understood in the art, and these proteins can be 
purified using known techniques such as chromatography, gel electrophoresis, selective 
precipitation, and the like. 

All or a portion of the hormones of the invention may be synthesized directly using 
2 5 peptide synthesis techniques known in the art. Synthesized portions may be ligated, and 
release sites for any drug contained in the linker moiety introduced by standard chemical 
means. For those embodiments which contain amino acids which are not encoded by the 
gene and those embodiments wherein the head-to-head or tail-to-tail configuration is 
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employed, of course, the synthesis must be at least partly at the protein level. Head-to- 
head junctions at the natural N-termini or at positions proximal to the natural N-termini 
may be effected through linkers which contain fianctional groups reactive with amino 
groups, such as dicarboxylic acid derivatives. Tail-to-tail configurations at the C-termini 
5 or positions proximal to the C-termini may be effected through linkers which are diamines, 
diols, or combinations thereof 



Antibodies 

The proteins of the invention may be used to generate antibodies specifically 

10 immunoreactive with these new compounds. These antibodies are useful in a variety of 
diagnostic and therapeutic applications. 

The antibodies are generally prepared using standard immunization protocols in 
mammals such as rabbits, mice, sheep or rats, and the antibodies are titered as polyclonal 
antisera to assure adequate immunization. The polyclonal antisera can then be harvested 

15 as such for use in for example, immunoassays. Antibody-secreting cells from the host, 
such as spleen cells, or peripheral blood leukocytes, may be immortalized using known 
techniques and screened for production of monoclonal antibodies immunospecific with the 
proteins of the invention. 

By "immunospecific for the proteins" is meant antibodies which are 

2 0 immunoreactive with the single-chain proteins, but not with the heterodimers per se within 
the general parameters considered to determine affinity or nonaflSnity. It is understood 
that specificity is a relative term, and an arbitrary limit could be chosen, such as a 
difference in immunoreactivity of 100-fold or greater. Thus, an immunospecific antibody 
included within the invention is at least 100 times more reactive with the single-chain 

2 5 protein than with the corresponding heterodimers. 

By "specifically immunoreactive" is meant that the antibodies react with the single 
chain forms of compounds of the invention and not with other molecules, even closely 
related ones, in measurable degree. Thus, although the antibodies of the invention will 
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specifically bind the single chain forms, they would bind the corresponding dimer or the 
individual subunits to a significantly lesser degree. 



Formulation 

5 The proteins of the invention are formulated and administered using methods 

comparable to those known for the heterodimers corresponding to the single-chain form. 
Thus, formulation and administration methods will vary according to the particular 
hormone used. However, the dosage level and frequency of administration may be altered 
as compared to the heterodimer, especially if CTP units are present in view of the 
10 extended biological half life due to its presence. 

Formulations for proteins of the invention are those typical of protein or peptide 
drugs such as found in Remington's Pharmaceutical Sciences , latest edition. Mack 
Publishing Conipany, Easton, PA. Generally, proteins are administered by injection, 
typically intravenous, intramuscular, subcutaneous, or intraperitoneal injection, or using 
15 formulations for transmucosal or transdermal delivery. These formulations generally 
include a detergent or penetrant such as bile salts, fusidic acids, and the like. These 
formulations can be administered as aerosols or suppositories or, in the case of 
transdermal administration, in the form of skin patches. 

Oral administration is also possible provided the formulation protects the peptides 
2 0 of the invention from degradation in the digestive system. 

Optimization of dosage regimen and formulation is conducted as a routine matter 
and as generally performed in the art. 

These formulations can also be modified to include those suitable for veterinary 
use as is generally known in the art. 

25 

Methods of Use 

The single-chain peptides of the invention may be used in many ways, most 
evidently as substitutes for the heteradimeric forms of the hormones. Thus, like the 
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heterodimers, the agonist forms of the single-chain hormones of the invention can be used 
in treatment of infertility, as aids in in vitro fertilization techniques, and other therapeutic 
methods associated with the native hormones. These techniques are applicable to humans 
as well as to other animals. The choice of the single-chain protein in terms of its species 
derivation will, of course, depend on the subject to which the method is appHed. 

The single-chain hormones are also useful as reagents in a manner similar to the 
heterodimers. 

In addition, the single-chain hormones of the invention may be used as diagnostic 
tools to detect the presence or absence of antibodies with respect to the native proteins in 
biological samples. They are also useful as control reagents in assay kits for assessing the 
levels of these hormones in various samples. Protocols for assessing levels of the 
hormones themselves or of antibodies raised against them are standard immunoassay 
protocols commonly known in the art. Various competitive and direct assay methods can 
be used involving a variety of labeling techniques including radio-isotope labeling, 
fluorescence labeling, enzyme labeling and the like. 

The single-chain hormones of the invention are also useful in detecting and 
purifying receptors to which the native hormones bind. Thus, the single-chain hormones 
of the invention may be coupled to solid supports and used in affinity chromatographic 
preparation of receptors or antihormone antibodies. The resulting receptors are 
themselves useful in assessing hormone activity for candidate drugs in screening tests for 
therapeutic and reagent candidates. 

Finally, the antibodies uniquely reactive with the single-chain hormones of the 
invention can be used as purification tools for isolation of subsequent preparations of these 
materials. They can also be used to monitor levels of the single-chain hormones 
administered as drugs. 

The following examples are intended to illustrate but not to limit the invention. 
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Example 1 
Preparation of DNA Encoding CG p-a 
Figure 1 shows the construction of an insert for an expression vector wherein the 
C-terminus of the p-chain of human CG is Unked to the N-terminus of the mature human a 
5 subunit. 

As shown in Figure 1, the polymerase chain reaction (PGR) is utilized to fuse the 
two subunits between exon 3 of CGp and exon 2 of the a subunit so that the codon for 
the carboxy terminal amino acid of CGp is fiised directly in reading frame to that of the 
N-terminal amino acid of the a subunit. This is accomplished by using a hybrid primer to 
10 amplify a fragment containing exon 3 of CGp wherein the hybrid primer contains a "tail" 
encoding the N-terminal sequence of the a subunit. The resulting amplified fragment thus 
contains a portion of exon 2 encoding human CGa. 

Independently, a hybrid primer encoding the N-terminal sequence of the a subunit 
fused to the codons corresponding to the C-terminus of CGp is used as one of the primers 
15 to amplify the a minigene. The two amplified fragments, each now containing overlapping 
portions encoding the other subunit are together amplified with two additional primers 
covering the entire span to obtain the Sail insert. 

In more detail, reaction 1 shows the production of a fragment containing exon 3 of 
CGp and the first four amino acids of the mature a subunit as well as a Sail site 5 '-ward of 
2 0 the coding sequences. It is obtained by amplifying a portion of the CGp genomic 

sequence which is described by Matzuk, M.M. et aL Proc Natl Acad Sci USA (1987) 
84:6354-6358; Policastro, P. etal J Biol Chem 0983) 258:11492-11499. 
Primer 1 provides the Sail site and has the sequence: 

2 5 5 ' -GGA GGA AGG GTG GTC GAG CTC TCT GGT-3 ' . 

Sail 

The other primer, primer 2, is complementary to four codons of the a N-terminal 
sequence and five codons of the CGp C-terminal sequence and has the sequence: 
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5' -CAC ATC AGG AGC I TTG TGG GAG GAT CGG-3' . 

5 The resultant amplified segment which is the product of reaction a thus has a Sail 

site 5'-ward of the fused coding region. 

In reaction II, an analogous fused coding region is obtained from the a minigene 
described hereinabove. Primer 3 is a hybrid primer containing four codons of the (3 
subunit and five codons of a and has the sequence: 

10 

5' -ATC CTC CCA CAaIgCT CCT GAT GTG CAG-3', 

Primer 4 contains a Sail site and is complementary to the extension of a exon 4. 
15 Primer 4 has the sequence: 

5 ' -TGA GTC GAC ATG ATA ATT CAG TGA TTG AAT-3 ^ , 
Sail 

2 0 Thus, the products of reactions I and II overlap, and when subjected to PCR in the 

presence of primers 1 and 4 yield the desired Sail product as shown in reaction III. 

The amplified fragment containing CGp exon 3 and the a minigene is inserted into 
the Sail site of pM^HA-CG(3exonl,2 an expression vector which is derived fi-om pM^ 
containing CGp exons 1 and 2 in the manner described by Sachais, B., Snider, R.M., 

25 Lowe, J., Krause, J. J Biol Chem (1993) 268:2319. pM^ containing CG(3 exons 1 and 2 is 
described in Matzuk, M.M. et al Proc Natl Acad USA (1987) 84:6354-6358 and Matzuk, 
M.M. etal iCdX Biol (1988) 106:1049-1059. 

This expression vector then will produce the single-chain form human CG wherein 
the C-terminus of the P subunit is directly linked to the N-terminus of the a subunit. 

30 
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Example 2 

Production and Activity of the Single-Chain Human CG 
The expression vector constructed in Example 1 was transfected into Chinese 
hamster ovary (CHO) cells and production of the protein was assessed by 
5 immunoprecipitation of radiolabeled protein on SDS gels. The culture medium was 

collected and the bioactivity of the single-chain protein was compared to the heterodimer 
in a competitive binding assay with respect to the human LH receptor. In this assay, the 
cDNA encoding the entire human LH receptor was inserted into the expression vector 
pCMX (Oikawa, J. X-C et aL Mol Endocrinol (1991) 5:759-768). Exponentially growing 

10 293 cells were transfected with this vector using the method of Chen, C. et al Mol Cell 
Biol (1987) 7:2745-2752. 

In the acsay, the cells expressing human LH receptor (2 x lOVtube) were incubated 
with 1 ng of labeled hCG in competition with the sample to be tested at 220C for 18 
hours. The samples were then diluted 5-fold with cold Dulbecco's PBS (2 ml) 

15 supplemented with 0T% BSA and centrifuged at 800 x g for 15 minutes. The pellets 

were washed twice with D's PBS and radioactivity was determined with a gamma counter. 
Specific binding was 10-12% of the total labeled (iodinated) hCG added in the absence of 
sample. The decrease in label in the presence of sample measures the binding ability in the 
sample. In this assay, with respect to the human LH receptor in 293 cells, the wild-type 

2 0 hCG had an ED50 of 0.47 ng and the single-chain protein had an ED50 of 1 . 1 ng. 

In an additional assay for agonist activity, stimulation of cAMP production was 
assessed. In this case, 293 cells expressing human LH receptors (2 x lOVtube) were 
incubated with varying concentrations of the heterodimeric hCG or single-chain hCG and 
cultured for 18 hours. The extracellular cAMP levels were determined by specific 

2 5 radioimmunoassay as described by Davoren, J.B. et al Biol Reprod (1985) 33:37-52. In 
this assay, the wild-type had an ED50 of 0.6 ng/ml and the single-chain form had an ED50 
of 1 .7ng/ml. (ED50 is 50% of the effective dose.) 
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Thus, in all cases, the behavior of both the wild-type and single-chain forms is 

similar. 

Example 3 

5 Additional Activity Assays 

The medium from CHO cells transfected with an expression vector for the pFSH- 
CTP-a single-chain construct was recovered and assayed as described in Example 2. The 
results of the competition assay for binding to FSH receptor are shown in Figure 3. The 
results indicate that the single-chain form is more effective than either wild-type FSH or 
10 FSH containing a CTP extension at the p chain in inhibiting binding of FSH itself to the 
receptor. The ED50 for the single-chain form is approximately 50 mlU/ml while the ED50 
for the extended heterodimer is somewhat over 100 mlU/ml. That for wild-type FSH is 
about 120 mlU/ml. 

The results of the signal transduction assay are shown in Figure 4, The 
1 5 effectiveness of all three types of FSH is comparable. 

Example 4 

Construction of Additional Expression Vectors 
In a maimer similar to that set forth in Example 1, expression vectors for the 
2 0 production of single-stranded FSH, TSH and LH (pFSH-a, pFSH-CTP-a, pTSH-a, 

PTSH-CTP-a, pLH-a, pLH-CTP-a) are prepared and transfected into CHO cells. The 
resulting hormones show activities similar to those of the wild-type form, when assayed as 
set forth in Example 2, 

The following documents are cited in the examples set forth below: 
25 37. Moyle, W.R. et aL J Biol Chem (1975) 250:9163-9169. 

54. Campbell, R.K. etaL Mol Cell Endocrinol (1992) 83:195-200. 

64. Campbell, R.K. et aL Proc Natl Acad Sci USA (1991) 88:760-764. 

65. Skaf, R. etaL Endocrinologv (1985) 117:106-113. 
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Single chain gonadotropins with lutropin and/or folliotropin activity. 

Example 5 

5 Preparation and use of Analog #1 (c.f . Table 1), a single chain gonadotropin 

with lutropin activity. (See Figure 5) 
The coding sequences for analog #1 listed in Table 1 can be synthesized using the 
block ligation approach described (54) or they can be prepared starting with the coding 
sequences for the hCG p-subunit and the human a-subunit. These can be cloned from a 
10 human placental cDNA library. The sequences encoding the signal peptide from the 
human a-subunit are deleted and the coding sequences for the proteins are spliced 
together using the SOEing technique (63) as follows: Primer #1 (100 ng) having the 
sequence 5 - 

ATGAAATCGACGGAATCAGACTCGAGCCAAGGATGGAGATGTTCCAGGGGCT 

15 GCT-3' and primer #2 (100 ng) having the sequence 3'- 

GGGAGCCTGTGGGGCTAGGAGGGGGTTCCTAGGCCATCGCCTAGACCATCG-5' 
are mixed with the hCG p-subunit cDNA (1 |ig) which serves as a template and PCR is 
performed for 25 temperature cycles of 94°C (30 seconds), 50X (60 seconds), 72X (60 
seconds) using Pfu DNA polymerase purchased from Strategene, LaJolla, CA and 

2 0 dioxynucleotide triphosphates and PCR buffer as described (63). Primer #3 (100 ng) 
having the sequence 5'- 

GGATCCGGTAGCGGATCTGGTAGCGCTCCTGATGTGCAGGATTGCCCA-3'and 
primer #4 (100 ng) having the sequence 3'- 

ACGTCATGAACAATAATAGTGTTTAGAATTCCATGGCCTAGGTAGAGTTCGAT 

2 5 TAGGCCT-5' are mixed with human a-subunit cDNA (1 ^g) which serves as a template 

and PCR is performed for 25 temperature cycles of 94°C (30 seconds), 50''C (60 
seconds), 72°C (60 seconds) using Pfu DNA polymerase and dioxynucleotide 
triphosphates and PCR buffer as described (63). These two PCR reactions give products 
that serve as intermediate templates in a third (final) PCR reaction that gives the desired 

3 0 constructs in a form suitable for cloning. The final PCR reaction is performed by mixing 1 

^1 of the products from the first two PCR reactions along with primer #5 having the 
sequence 5'-ATGAAATCGACGGAATCAGACTCGAGCCAAGG-3* and primer #6 
having the sequence 3'-ATTCCATGGCCTAGGTAGAGTTCGATTAGGCCT-5' for 25 
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temperature cycles of 94°C (30 seconds), SO'^C (60 seconds), 72°C (60 seconds) using Pfu 
DNA polymerase, additional dioxynucleotide triphosphates, and PCR buffer. The final 
PCR product is digested with restriction enzymes Xhol and Bglll and ligated into pSVL 
(an expression vector obtained fi*om Pharmacia, Piscataway, NJ) that has been digested 
5 with Xhol and BamHI to create a vector that will direct the synthesis of Analog 1 . The 
Xhol site of thi: PCR product will ligate to the Xhol site of pS VL and the Bglll site of the 
PCR product will ligate to the BamHI site of pSVL. The Xhol site will be regenerated 
and the Bglll and BamHI sites will be eliminated. The sequences of the coding regions 
(i.e., between the Xbal and Kpnl sites, c.f , Figure 6) of several constructs are determined 

10 until one is found that encodes a protein having the desired amino acid sequence illustrated 
in Figure 6. This is done to eliminate the possible errors that arise as the result of PCR 
and other DNA manipulation and is a standard precaution to be certain that the desired 
sequence is obtained. The expressed protein is expected to lack amino acid residues 
MEMFQGLLLLLLLSMGGTWA that are the part of the signal sequence found in hCG 

15 p-subunit and which are removed by the cell during protein synthesis. This vector is 
expressed in COS-7 cells as described (64) and the protein released into the medium is 
tested for its ability to inhibit the binding of radioiodinated hCG to monoclonal antibodies 
or to antisera prepared against hCG. The protein made by the COS-7 cells will compete 
with radioiodinated hCG for binding to one or more of the following antibodies: B 101 

20 (obtained fi-om Columbia University), B105 (obtained from Columbia University), B107 
(obtained from Columbia University), B109 (obtained from Columbia University), A201 
(obtained from Columbia University), HCU061 (obtained fi'om Hybritech), HCZ107 
(obtained fi*om Hybritech), or HC0514 (obtained from Hybritech), ZMCG18 (obtained 
fi*om Pierce), ZMCG13 (obtained fi-om Pierce), or ZMCG7 (obtained from Pierce) or 

25 51 8B7 (obtained from Dr, Janet Roser, University of California at Davis). The protein 
released into the medium will compete with radiolabeled hCG for binding to receptors on 
corpora lutea as described by Campbell, Dean-Emig, and Moyle (64). It would be 
expected to stimulate testosterone formation in a Leydig cell assay performed similar to 
that described by Moyle et al (37) and to stimulate ovulation in female animals and to 

3 0 stimulate testosterone formation in male mammals. This analog would also be expected to 
be a good starting point for use in a contraceptive vaccine using the template approach 
outlined in Example 1 1 . This analog is shown in Table 1 as Analog #1 and contains a 
linker sequence of GSGSGSGS, This linker can be modified by digesting the expression 
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vector with Apal and Eco47III endonuclease restriction enzymes, discarding the short 
piece, ligating a cassette of synthetic double stranded DNA with the desired amino acid 
codons containing any number of glycine or serine codons or other amino acid codons into 
the ApaI/Eco47III site by standard methods, sequencing the region between the 
5 ApaI/Eco47III to confirm the desired mutations have been made, and expressing the 
protein in COS-7 cells. This can be done to optimize the activity of the single chain 
gonadotropin. The protein is expected to function as a monomer or to combine to form 
active homodimers. In addition, several copies of the protein would be expected to 
combine to form multimers. 

10 

Example 6 

Preparation and use of Analog #2. a single chain gonadotropin with lutropin activity. 

(See Figure 6) 

The coding sequences for Analog #2 listed in Table 1 can be synthesized using the 
15 block ligation approach described (54) or they can be prepared by PGR using primers #1 
and #7 and the expression construct described in Example 5 and in Figure 5 as a template. 
The sequence of primer #7 is 3'- 

TGGTGGGGAACTGGACACTACTGGGCGCCCCTAGGCCATCG-5'. The final PGR 
product is digested with restriction enzymes Xhol and BamHI and ligated with the large 
2 0 fragment of DNA obtained by digesting the expression construct described in Example 12 
with Xhol and BamHI. The sequences of the coding regions between the Xhol and 
BamHI sites of several constructs are determined until one is found that encodes a protein 
having the amino acid sequence described in Figure 7 is obtained. This will insure that 
cloning artifacts are not present in the region that has been altered. The expressed protein 

2 5 is expected to lack amino acid residues MEMFQGLLLLLLLSMGGTWA that are the part 

of the signal sequence found in hCG p-subunit and which are removed by the cell during 
protein synthesis. This vector is expressed in GOS-7 cells and the protein released into the 
medium is tested for its ability to inhibit the binding of radioiodinated hCG to monoclonal 
antibodies or to antisera prepared against hGG. The protein made by the GOS-7 cells will 

3 0 compete with radioiodinated hGG for binding to one or more of the following antibodies: 

BlOl (obtained from Golumbia University), B105 (obtained from Golumbia University), 
B107 (obtained from Golumbia University), B109 (obtained from Golumbia University), 
A201 (obtained fi*om Golumbia University), HGU061 (obtained from Hybritech), or 
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HC05 14 (obtained from Hybritech), ZMCG18 (obtained from Pierce), ZMCG13 
(obtained from Pierce), or ZMCG7 (obtained from Pierce) or 5 18B7 (obtained from Dr. 
Janet Roser, University of California at Davis). The protein released into the medium will 
compete with radiolabeled hCG for binding to receptors on corpora lutea as described by 
5 Campbell, Dean-Emig, and Moyle (64). It would be expected to stimulate testosterone 
formation in a Leydig cell assay performed similar to that described by Moyle et al (37) 
and to stimulate ovulation in female animals and to stimulate testosterone formation in 
male manrmials. This analog would also be expected to be a good starting point for use in 
a contraceptive vaccine using the template approach outlined in Example 1 1 . This analog 

10 is shown in Table 1 as Analog #2 and contains a linker sequence of GSGSGSGS. This 
linker can be modified by digesting the expression vector with Sstll and Eco47III 
endonuclease restriction enzymes, discarding the short piece, ligating a cassette of 
synthetic double stranded DNA with the desired amino acid codons containing any number 
of glycine or serine codons or other amino acid codons into the SstII/Eco47III site by 

15 standard methods, sequencing the region between the SstII/Eco47III to confirm the 

desired mutations have been made, and expressing the protein in COS-7 cells. This can be 
done to optimize the activity of the single chain gonadotropin. The protein is expected to 
function as a monomer or to combine to form active homodimers. In addition, several 
copies of the protein would be expected to combine to form multimers. 

20 

Example 7 

Preparation and use of Analog #3. a single chain gonadotropin with lutropin activity. 

(See Figure 7) 

The coding sequences for analog #3 listed in Table 1 can be synthesized using the 
25 block ligation approach described (54) or they can be prepared in the fashion as described 
for Analog #2 in Example 6 except that primers #1 and #7 are replaced with primers #8 
and #9 and that the hLH p-subunit cDNA is used as a template in place of the hCG p- 
subunit cDNA. The hLH p-subunit cDNA can be obtained by screening a human pituitary 
library. The sequence of primer #8 is 5'- 
3a ATGAAATCGACGGAATCAGACTCGAGCCAAGGAATGGAGATGCTCCAGGGGC 
TGCT-3' and the sequence of primer #9 is 3 - 

GTGGGGAACTGGACACTGGTGGGGGTTCCTAGGCCATCGCCTAGACCATCG- 
5'. The final PCR product is digested with restriction enzymes Xhol and BamHI and 
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subcloned into the XhoI/BamHI sites of the expression vector created as described in 
Example 12. The sequences of the coding regions between the Xhol and BamHI sites of 
several constructs are determined until one is found that encodes a protein having the 
amino acid sequence shown in Figure 8. The expressed protein is expected to lack amino 
5 acid residues MEMLQGLLLLLLLSMGGAWA that are the part of the signal sequence 
found in hLH P-subunit and which are removed by the cell during protein synthesis. This 
vector is expressed in COS-7 cells and the protein released into the medium is tested for 
its ability to inhibit the binding of radioiodinated hCG to monoclonal antibodies or to 
antisera prepared against hCG. The protein made by the COS-7 cells will compete with 

10 radioiodinated hCG for binding to one or more of the following antibodies: BlOl 

(obtained from Columbia University), B105 (obtained from Columbia University), A201 
(obtained from Columbia University), HCU061 (obtained from Hybritech), ZMCG7 
(obtained from Pierce) or 5 18B7 (obtained from Dr. Janet Roser, University of California 
at Davis). The protein released into the medium will compete with radiolabeled hCG for 

15 binding to receptors on corpora lutea as described by Campbell, Dean-Emig, and Moyle 
(64), It would be expected to stimulate testosterone formation in a Leydig cell assay 
performed similar to that described by Moyle et al. (37) and to stimulate ovulation in 
female animals and to stimulate testosterone formation in male mammals. This analog 
would also be expected to be a good starting point for use in designing vaccines to 

2 0 enhance or inhibit fertility using the template procedure outlined earlier. This analog is 

shown in Table 1 as Analog #3 and contains a linker sequence of GSGSGSGS. This 
linker can be modified by digesting the expression vector with BamHI and Eco47III 
endonuclease restriction enzymes, discarding the short piece, ligating a cassette of 
synthetic double stranded DNA with the desired amino acid codons containing any number 
25 of glycine or serine codons or other amino acid codons into the BamHI/Eco47III site by 
standard methods, sequencing the region between the BamHI/Eco47III to confirm the 
desired mutations have been made, and expressing the protein in COS-7 cells. This can be 
done to optimize the activity of the single chain gonadotropin. The protein is expected to 
Sanction as a monomer or to combine to form active homodimers. In addition, several 

3 0 copies of the protein would be expected to combine to form multimers. 
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Example 8 

Preparation and use of Analog #4, a single chain gonadotropin with foUitropin activity. 

fSee Figure 8) 

The coding sequences for analog #4 listed in Table 1 can be synthesized using the 
5 block ligation approach described (54) or they can be prepared in the fashion as described 
for Analog #2 in Example 13 except that primers #1 and #7 are replaced with primers #10 
and #1 1 and that the hFSH P-subunit cDNA is used as a template in place of the hCG p- 
subunit cDNA, The hFSH p-subunit cDNA can be obtained from a human pituitary gland 
library. The sequence of primer #10 is 5'- 
10 ATGAAATCGACGGAATCAGACTCGAGCCAAGGATGAAGACACTCCAGTTTTTC 
TTCC-3' and the sequence of primer #1 1 is 3'- 

GACGAGGAAACCACTTTACTTTCTTCCTAGGCCATCGCCTAGACCA-5'. The 
final PGR product is digested with restriction enzymes Xhol and BamHI and subcloned 
into the XhoI/BamHI sites of the expression vector created as described in Example 12. 

15 The sequences of the coding regions between the Xbal and BamHI sites of several 

constructs are determined until one is found that encodes a protein having the amino acid 
sequence illustrated in Figure 9. The expressed protein is expected to lack amino acid 
residues MKTLQFFFLFCCWKAICC that are the part of the signal sequence found in 
hFSH p-subunit and which are removed by the cell during protein synthesis. The vector is 

2 0 expressed in COS-7 cells and the protein made by the cells will compete with 

radioiodinated hFSH for binding to one or more of the following antibodies: ZMFS 1 
(obtained from Pierce), A201 (obtained from Columbia University), HCU061 (obtained 
from Hybritech), FSG761 (obtained from Hybritech), FSR093.3 (obtained from 
Hybritech), FSH107 (obtained from Hybritech), FSB061 (obtained fi^om Hybritpch), 

2 5 FSM2 10 (obtained from Hybritech), and FSM268 (obtained fi-om Hybritech). The protein 

released into the medium will compete with hFSH for binding to receptors on bovine 
testes as described by Campbell, Dean-Emig, and Moyle (64). It would be expected to 
stimulate estradiol formation in a granulosa cell assay performed similar to that described 
by Skaf et al (65) and to stimulate follicle development and spermatogenesis in female and 

3 0 male mammals. This analog is also a usefial starting compound to select for an 

immunogen that elicits antibodies to FSH and is part of a contraceptive vaccine. This 
analog is shown in Table 1 as Analog #4 and contains a linker sequence of GSGSGSGS. 
This linker can be modified by digesting the expression vector with Apal and Eco47III 
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endonuclease restriction enzymes, discarding the sliort piece, ligating a cassette of 
synthetic double stranded DNA with the desired amino acid codons containing any number 
of glycine or serine codons or other amino acid codons into the BamHI/Eco47III site by 
standard methods, sequencing the region between the ApaI/Eco47III to confirm the 
5 desired mutations have been made, and expressing the protein in COS-7 cells. This can be 
done to optimize the activity of the single chain gonadotropin. The protein is expected to 
function as a monomer or to combine to form active homodimers. In addition, several 
copies of the protein would be expected to combine to form multimers. 

10 Example 9 

Preparation and use of Analog #5. a single chain gonadotropin with FSH activity that is 
structurally more similar to hCG than hFSH. (See Figure 9) 
The coding sequences for analog #5 listed in Table 1 can be synthesized using the 
block ligation approach described (54) or they can be prepared in the fashion as described 
15 for Analog #2 in Example 6 except that primer #7 is replaced with primer #12. The 
sequence of primer #12 is 3- 

CGACAGTCGACAGTTACACGTGAGACGCTGTCGCTGTCGTGACTAACATGACA 
CGCTCCGGACCCCGGGTCGATGACGAGGAAACCACTTTACTTTCTTCCTAGGC 
CATCG-5'. The final PGR product is digested with restriction enzymes Xhol and BamHI 

2 0 and subcloned into the XhoI/BamHI sites of the expression vector created as described in 

Example 12. The sequences of the coding regions between the Xbal and BamHI sites of 
several constructs are determined until one is found that encodes a protein having the 
amino acid sequence illustrated in Figure 10. The expressed protein is expected to lack 
amino acid residues MEMLQGLLLLLLLSMGGAWA that are the part of the signal 
25 sequence found in hCG p-subunit and which are removed by the cell during protein 
synthesis. This vector is expressed in COS-7 cells and the protein released into the 
medium is tested for its ability to inhibit the binding of radioiodinated hCG to monoclonal 
antibodies or to antisera prepared against hCG. The protein made by the COS-7 cells will 
compete with radioiodinated hCG for binding to one or more of the following antibodies: 

3 0 BlOl (obtained fi*om Columbia University), B105 (obtained fi'om Columbia University), 

B107 (obtained from Columbia University), B109 (obtained from Columbia University), 
A201 (obtained from Columbia University), HCU061 (obtained from Hybritech), or 
HC0514 (obtained from Hybritech), ZMCG18 (obtained from Pierce), ZMCG13 
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(obtained from Pierce), or ZMCG7 (obtained from Pierce) or 518B7 (obtained from Dr. 
Janet Roser, University of California at Davis). The protein released into the medium will 
compete with hFSH for binding to receptors on bovine testes as described by Campbell, 
Dean-Emig, and Moyle (64). It would be expected to stimulate estradiol formation in a 
5 granulosa cell assay performed similar to that described by Skaf et al (65) and to stimulate 
follicle development and spermatogenesis in female and male mammals. This analog is 
shown in Table 1 as Analog #5 and contains a Hnker sequence of GSGSGSGS. This 
linker can be modified by digesting the expression vector with Apal and Eco47III 
endonuclease restriction enzymes, discarding the short piece, ligating a cassette of 

10 synthetic double stranded DNA with the desired amino acid codons containing any number 
of glycine or serine codons or other amino acid codons into the BamHI/Eco47III site by 
standard methods, sequencing the region between the ApaI/Eco47III to confirm the 
desired mutations have been made, and expressing the protein in COS-7 cells. This can be . 
done to optimize the activity of the single chain gonadotropin. The protein is expected to 

15 function as a monomer or to combine to form active homodimers. In addition, several 
copies of the protein would be expected to combine to form multimers. 

Example 10 

Preparation and use of Analog #6, a single chain gonadotropin with FSH and LH activities 
20 that is structurally more similar to hCG than hFSH. (See Figure 10) 

The coding sequences for analog #6 listed in Table 1 can be synthesized using the 
block ligation approach described (54) or they can be prepared in the fashion as described 
for Analog #2 in Example 6 except that primer #7 is replaced with primer #13. The 
sequence of primer #13 is 3'- 
25 ACGGCGGCGTCGTGGTGACTGACGTGACACGCTCCGGACCCCGGGTCGATGA 
CGAGGAAACCACTTTACTTTCTTCCTAGGCCATCG-5'. The final PGR product is 
digested with restriction enzymes Xhol and BamHI and subcloned into the XhoI/BamHI 
sites of the expression vector created as described in Example 12. The sequences of the 
coding regions between the Xbal and BamHI sites of several constructs are determined 
3 0 until one is found that encodes a protein having the amino acid sequence illustrated in 
Figure 1 1 . The expressed protein is expected to lack amino acid residues 
MEMLQGLLLLLLLSMGGAWA that are the part of the signal sequence found in hCG- 
subunit and which are removed by the cell during protein synthesis. This vector is 
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expressed in COS-7 cells and the protein released into the medium is tested for its ability 
to inhibit the binding of radioiodinated hCG to monoclonal antibodies or to antisera 
prepared against hCG. The protein made by the COS-7 cells will compete with 
radioiodinated hCG for binding to one or more of the following antibodies: B 101 
5 (obtained from Columbia University), B105 (obtained from Columbia University), B107 
(obtained from Columbia University), B109 (obtained from Columbia University), A201 
(obtained from Columbia University), HCU061 (obtained from Hybritech), or HC05 14 
(obtained from Hybritech), ZMCG18 (obtained from Pierce), ZMCG13 (obtained from 
Pierce), or ZMCG7 (obtained from Pierce) or 5 18B7 (obtained from Dr. Janet Roser, 

10 University of California at Davis). The protein released into the medium will compete 
with hFSH for binding to receptors on bovine testes as described by Campbell, Dean- 
Emig, and Moyle (64). It would be expected to stimulate estradiol formation in a 
granulosa cell assay performed similar to that described by Skaf et al (65) and to stimulate . 
follicle development and spermatogenesis in female and male mammals. The protein 

15 released into the medium will compete with radiolabeled hCG for binding to receptors on 
corpora lutea as described by Campbell, Dean-Emig, and Moyle (64). It would be 
expected to stimulate testosterone formation in a Leydig cell assay performed similar to 
that described by Moyle et al. (37) and to stimulate ovulation in female animals and to 
stimulate testosterone formation in male mammals. This analog is shown in Table 1 as 

2 0 Analog #6 and contains a linker sequence of GSGSGSGS. This linker can be modified by 
digesting the expression vector with Apal and Eco47III endonuclease restriction enzymes, 
discarding the short piece, ligating a cassette of synthetic double stranded DNA with the 
desired amino acid codons containing any number of glycine or serine codons or other 
amino acid codons into the BamHI/Eco47III site by standard methods, sequencing the 

2 5 region between the ApaI/Eco47III to confirm the desired mutations have been made, and 
expressing the protein in COS-7 cells. This can be done to optimize the activity of the 
single chain gonadotropin. The protein is expected to fimction as a monomer or to 
combine to fonn active homodimers. In addition, several copies of the protein would be 
expected to combine to form multimers. 
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Example 11 

Preparation and use of Analog #7, a single chain gonadotropin with FSH and LH activities 
that is structurally more similar to hCG than hFSH. 
The coding sequences for analog #7 listed in Table 1 can be synthesized using the 
5 block ligation approach described (54) or they can be prepared in the fashion as described 
for Analog #2 in Example 6 except that primer #7 is replaced with primer #14. The 
sequence of primer #14 is 3 - 

ACGGCGGCGTCGTGGTGACTGACGTGACACGCTCCGGACCCCGGGTCGATGA 
CGAGGAAACCACTTCCTAGGCCATCG-5'. The final PGR product is digested with 

10 restriction enzymes Xhol and BamHI and subcloned into the XhoI/BamHI sites of the 
expression vector created as described in Example 12, The sequences of the coding 
regions between the Xbal and BamHI sites of several constructs are determined until one 
is found that encodes a protein having the amino acid sequence illustrated in Figure 12. 
The expressed protein is expected to lack amino acid residues 

15 MEMLQGLLLLLLLSMGGAWA that are the part of the signal sequence found in hCG 
P-subunit and which are removed by the cell during protein synthesis. This vector is 
expressed in COS -7 cells and the protein released into the medium is tested for its ability 
to inhibit the binding of radioiodinated hCG to monoclonal antibodies or to antisera 
prepared against hCG. The protein made by the COS-7 cells will compete with 

2 0 radioiodinated iiCG for binding to one or more of the following antibodies: B 1 0 1 

(obtained from Columbia University), B105 (obtained from Columbia University), B107 
(obtained fi-om Columbia University), B109 (obtained from Columbia University), A201 
(obtained from Columbia University), HCU061 (obtained from Hybritech), or HC0514 
(obtained from Hybritech), ZMCG18 (obtained from Pierce), ZMCG13 (obtained from 
25 Pierce), or ZMCG7 (obtained from Pierce) or 518B7 (obtained from Dr. Janet Roser, 
University of California at Davis). The protein released into the medium will compete 
with hFSH for binding to receptors on bovine testes as described by Campbell, Dean- 
Emig, and Moyle (64). It would be expected to stimulate estradiol formation in a 
granulosa cell assay performed similar to that described by Skaf et al (65) and to stimulate 

3 0 follicle development and spermatogenesis in female and male mammals. The protein 

released into the medium will compete with radiolabeled hCG for binding to receptors on 
corpora lutea as described by Campbell, Dean-Emig, and Moyle (64). It would be 
expected to stimulate testosterone formation in a Leydig cell assay performed similar to 
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that described by Moyle et al. (37) and to stimulate ovulation in female animals and to 
stimulate testosterone formation in male mammals. This analog is shown in Table 1 as 
Analog #17 and contains a linker sequence of GSGSGSGS. This linker can be modified 
by digesting the expression vector with Apal and Eco47III endonuclease restriction 
5 enzymes, discarding the short piece, ligating a cassette of synthetic double stranded DNA 
with the desired amino acid codons containing any number of glycine or serine codons or 
other amino acid codons into the BamHI/Eco47III site by standard methods, sequencing 
the region between the ApaI/Eco47III to confirm the desired mutations have been made, 
and expressing the protein in COS-7 cells. This can be done to optimize the activity of the 
10 single chain gonadotropin. The protein is expected to function as a monomer or to 

combine to form active homodimers. In addition, several copies of the protein would be 
expected to combine to form multimers. 

Example 12 

15 Preparation and use of Analog #8. a single chain gonadotropin with FSH and LH activities 
that is structurally mere similar to hCG than hFSH. (See Figure 12) 
The coding sequences for analog #8 listed in Table 1 can be synthesized using the 
block ligation approach described (54) or they can be prepared in the fashion as described 
for Analog #2 in Example 6 except that primer #7 is replaced with primer #15. The 
2 0 sequence of primer #15 is 3'- 

ACGGCGGCGTCGTGGTGACTGACGTGACACGCTCCGGACCCCGGGTCGATGA 
CGCTACTGGGCGCCCCTAGGCCATCG-5'. The final PGR product is digested with 
restriction enzymes Xhol and BamHI and subcloned into the XhoI/BamHI sites of the 
expression vector created as described in Example 5. The sequences of the coding regions 

2 5 between the Xbal and BamHI sites of several constructs are determined until one is found 

that encodes a protein having the amino acid sequence illustrated in Figure 6. The 
expressed protein is expected to lack amino acid residues 

MEMLQGLLLLLLLSMGGAWA that are the part of the signal sequence found in hCG 
p-subunit and which are removed by the cell during protein synthesis. This vector is 

3 0 expressed in COS-7 cells and the protein released into the medium is tested for its ability 

to inhibit the binding of radioiodinated hCG to monoclonal antibodies or to antisera 
prepared against hCG. The protein made by the COS-7 cells will compete with 
radioiodinated hCG for binding to one or more of the following antibodies: BlOl 
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(obtained from Columbia University), B105 (obtained from Columbia University), B107 
(obtained from Columbia University), B109 (obtained from Columbia University), A201 
(obtained from Columbia University), HCU061 (obtained from Hybritech), or HC05 14 
(obtained from Hybritech), ZMCG18 (obtained from Pierce), ZMCG13 (obtained from 
5 Pierce), or ZMCG7 (obtained from Pierce) or 5 1 8B7 (obtained from Dr. Janet Roser, 
University of California at Davis). The protein released into the medium will compete 
with hFSH for binding to receptors on bovine testes as described by Campbell, Dean- 
Emig, and Moyle (64). It would be expected to stimulate estradiol formation in a 
granulosa cell assay performed similar to that described by Skaf et al (65) and to stimulate 

10 follicle development and spermatogenesis in female and male mammals. The protein 

released into the medium will compete with radiolabeled hCG for binding to receptors on 
corpora lutea as described by Campbell, Dean-Emig, and Moyle (64). It would be 
expected to stimulate testosterone formation in a Leydig cell assay performed similar to 
that described by Moyle et ah (37) and to stimulate ovulation in female animals and to 

15 stimulate testosterone formation in male mammals. This analog is shown in Table 1 as 

Analog #8 and contains a linker sequence of GSGSGSGS. This linker can be modified by 
digesting the expression vector with Apal and Eco47III endonuclease restriction enzymes, 
discarding the short piece, ligating a cassette of synthetic double stranded DNA with the 
desired amino acid codons containing any number of glycine or serine codons or other 

2 0 amino acid codons into the BamHI/Eco47in site by standard methods, sequencing the 
region between the ApaI/Eco47III to confirm the desired mutations have been made, and 
expressing the protein in COS-7 cells. This can be done to optimize the activity of the 
single chain gonadotropin. The protein is expected to function as a monomer or to 
combine to form active homodimers. In addition, several copies of the protein would be 

2 5 expected to combine to form multimers. 

Example 13 

Preparation and use of Analog #9, a single chain gonadotropin with follitropin activity. 

(See Figure 13) 

3 0 The coding sequences for analog #9 listed in Table 1 can be synthesized using the 

block ligation approach described (54) or they can be prepared by digesting the construct 
described in Example 8 used to express Analog 4 with the restriction enzymes Apal and 
Bamffl. The small piece is replaced with a cassette of synthetic DNA to give the 

84670 



PATENT 

Atty Dkt: 295002005025 



-46- 

sequence illustrated in Figure 13. The coding sequence between the Apal and BamHI 
sites of several constructs is determined until one is found that encodes a protein having 
the amino acid sequence illustrated in Figure 13, The expressed protein is expected to 
lack amino acid residues MKTLQFFFLFCCWKAICC that are the part of the signal 
5 sequence found in hFSH p-subunit and which are removed by the cell during protein 
synthesis. The vector is expressed in COS-7 cells and the protein made by the cells will 
compete with radioiodinated hFSH for binding to one or more of the following antibodies: 
ZMFSl (obtained from Pierce), A201 (obtained from Columbia University), HCU061 
(obtained from liybritech), FSG761 (obtained from Hybritech), FSR093,3 (obtained from 

10 Hybritech), FSH107 (obtained from Hybritech), FSB061 (obtained from Hybritech), 

FSM210 (obtained from Hybritech), and FSM268 (obtained from Hybritech). The protein 
released into the medium will compete with hFSH for binding to receptors on bovine 
testes as described by Campbell, Dean-Emig, and Moyle (64). It would be expected to 
stimulate estradiol formation in a granulosa cell assay performed similar to that described 

15 by Skaf et al (65) and to stimulate follicle development and spermatogenesis in female and 
male mammals. This analog is also a useful starting compound to select for an 
immunogen that elicits antibodies to FSH and is part of a contraceptive vaccine. This 
analog is shown in Table 1 as Analog #9 and contains a linker sequence of GSGSGSGS. 
This linker can be modified by digesting the expression vector with Apal and Eco47in 

2 0 endonuclease restriction enzymes, discarding the short piece, ligating a cassette of 

synthetic double stranded DNA with the desired amino acid codons containing any number 
of glycine or serine codons or other amino acid codons into the BamHI/Eco47III site by 
standard methods, sequencing the region between the ApaI/Eco47III to confirm the 
desired mutations have been made, and expressing the protein in COS-7 cells. This can be 

2 5 done to optimize the activity of the single chain gonadotropin. The protein is expected to 

function as a monomer or to combine to form active homodimers. In addition, several 
copies of the protein would be expected to combine to form multimers. 

Example 14 

3 0 Preparation and use of Analog #10. a single chain gonadotropin with follitropin activity. 

(See Figure 14) 

The coding sequences for Analog #10 listed in Table 1 can be synthesized using 
the block ligation approach described (54) or they can be prepared by digesting the 
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constmct described in Example 8 used to express Analog 4 with the restriction enzymes 
Apal and BamHI. The small piece is replaced with a cassette of synthetic DNA to give 
the sequence illustrated in Figure 14. The coding sequence between the Apal and BamHI 
sites of several constructs is determined until one is found that encodes a protein having 
5 the amino acid sequence illustrated in Figure 14. The expressed protein is expected to 
lack amino acid residues MKTLQFFFLFCCWKAICC that are the part of the signal 
sequence found in hFSH (3-subunit and which are removed by the cell during protein 
synthesis. The vector is expressed in COS-7 cells and the protein made by the cells will 
compete with radioiodinated hFSH for binding to one or more of the following antibodies: 

10 ZMFSl (obtained from Pierce), A201 (obtained from Columbia University), HCU061 

(obtained from Hybritech), FSG761 (obtained from Hybritech), FSR093.3 (obtained from 
Hybritech), FSH107 (obtained from Hybritech), FSB061 (obtained from Hybritech), 
FSM210 (obtained from Hybritech), and FSM268 (obtained from Hybritech). The protein 
released into the medium will compete with hFSH for binding to receptors on bovine 

15 testes as described by Campbell, Dean-Emig, and Moyle (64). It would be expected to 
stimulate estradiol formation in a granulosa cell assay performed similar to that described 
by Skaf et al (65) and to stimulate follicle development and spermatogenesis in female and 
male mammals. This analog is also a useful starting compound to select for an 
immunogen that eUcits antibodies to FSH and is part of a contraceptive vaccine. This 

2 0 analog is shown in Table 1 as Analog #10 and contains a linker sequence of GSGSGSGS. 
This linker can be modified by digesting the expression vector with Apal and Eco47III 
endonuclease restriction enzymes, discarding the short piece, ligating a cassette of 
synthetic double stranded DNA with the desired amino acid codons containing any number 
of glycine or serine codons or other amino acid codons into the BamHI/Eco47III site by 

25 standard methods, sequencing the region between the ApaI/Eco47III to confirm the 

desired mutations have been made, and expressing the protein in COS-7 cells. This can be 
done to optimize the activity of the single chain gonadotropin. The protein is expected to 
function as a monomer or to combine to form active homodimers. In addition, several 
copies of the protein would be expected to combine to form multimers. 

30 
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Example 15 

Preparation of an a-subunit analog lacking glycosvlation sites. (See Figure 15) 
Analogs 1-10 are expected to contain 4 asparagine-linked oligosaccharides since 
they contain 4 sets of codons for the sequence Asparagine-X-Threonine/Serine where X is 
5 any amino acid except proline. Removal of the asparagine-linked oligosaccharides, 
particularly those of the a-subunit, has been shown to reduce hormone efficacy. The 
asparagine-linked glycosylation signals can be removed from the a-subunit portion of the 
single chain gonadotropins using PGR as described here. PGR primer 16 having the 
sequence: 5'- 

10 TGGTTGTGTAGAGCATATGGGAGTCGAGTAAGGTGGAAGAAGAGGATGTTGGT 
GGAAAAGGAAGTGAGGT-3' and PGR primer 17 having the sequence: 3'- 
CAAAGTTTGAGGTGGTTGTGTGGCGCAGGGTGAGGTGATGAAGAATAATAGTG 
TTTAGAATTGCATGGGGATG-5' are used in a PGR reaction with a the vector that is 
capable of directing the expression of Analog 1 and that was described in Example 5 and 

15 Figure 5. After 25 cycles in the conditions described in Example 5, the PGR product and 
the expression vector are digested with Xbal and Kpnl. The small fragment produced by 
digestion of the vector is discarded and the digested PGR product is ligated into the vector 
in its place. This produces an expression vector that encodes Analog 1 1, an analog that 
contains only 2 Asn-linked glycosylation signals but that is expected to retain its affinity 

2 0 for antibodies and antisera that bind to hGG. It is also expected to retain its affinity for 

LH receptors as shown by its ability to compete with hGG for binding to membranes from 
rat corpora lutea. However, it is expected to have a reduced ability to induce signal 
transduction, expecially when its ability to elicit cyclic AMP accumulation is tested (37). 
It is possible to create similar derivatives of Analogs 2-10 in which the oligosaccharides 
25 are removed from the portion of the protein derived from the a-subunit by digesting each 
of the expression vectors with BamHI and Kpnl, discarding the smaller piece, and ligating 
the small BamHI/Kpnl fragment obtained by digestion of Analog 1 1 . Thus, Analog 2 
would become Analog 12, Analog 3 would become Analog 13, Analog 4 would become 
Analog 14, Analog 5 would become Analog 15, Analog 6 would become Analog 16, 

3 0 Analog 7 would become Analog 17, Analog 8 would become Analog 18, Analog 9 would 

become Analog 19, and Analog 10 would become Analog 20. Note that it would also be 
possible to remove only one of the two glycosylation signals on the portion of the single 
chain gonadotropins derived from the a-subunit simply by changing the sequences of 
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primers 16 and 17 during their synthesis and following the protocol outlined here. Each of 
these analogs would exhibit the same antibody and receptor binding as their precursors. 
They would have reduced efficacy and as a consequence, they would inhibit signal 
transduction. Analogs 11, 12, and 13 would reduce the activity of LH and would 
5 stimulate fertility when given in the early part of the follicular phase of the menstrual cycle. 
They would reduce the activity of hCG and would prevent fertility when administered near 
the time of expected menses. 

Example 16 

10 Preparation of Analog la lacking asparagine-linked oligosaccharides. 

(See Figures 16 and 17) 
The efficacy of gonadotropins is proportional to their content of carbohydrates and 
while Analogs 11, 12, 13, 14, 15, 16, 17, 18, 19, and 20 have lower efficacy, it is possible 
to reduce their efficacy further by eliminating all oligosaccharide chains. The asparagine- 
15 linked oligosaccharide chains can be eliminated from Analog 1 1 by PCR SOEing (63) 
using primers 1 and 18 in one reaction and primers 2 and 19 in a second reaction. The 
expression vector for Analog 1 1 serves as a template in both reactions. The sequence of 
primer #18 is 5'- 

CGGGGTAGGTTCGGTGGGACCGACACCTCTTCCTCCCGACGGGG-3' and the 

2 0 sequence of primer #19 is 3 - 

GTGGAGAAGGAGGGCTGCCCCGTGTGCATCACCGTCAACACCACCATC-5\ 
After 25 temperature cycles at 94°C (30 sec), 55^C (60 sec), and 72X (60 sec), 1 ^1 of 
each PCR reaction is mixed with primer #5 and additional primer #2, new buffer, enzyme, 
and deoxynucleotide triphosphates. The reaction product after 25 additional cycles is cut 
25 with Xhol and BamHI and substituted for the original DNA found between the 

XhoI/BamHI sites of the vector encoding Analog 1 1 . This is accomplished by digesting 
the vector with Xhol and BamHI, discarding the small fragment and then ligating the large 
fi*agment with the XhoI/BamHI digested PCR product. Several clones are subjected to 
DNA sequencing until the one encoding the analog outlined in Figure 18 termed Analog 

3 0 la is obtained. When this is expressed in COS-7 cells, the protein that is made will be 

recognized by the same antibodies and antisera as Analog 1 . Analog la will also bind to 
lutropin receptors but will have reduced efficacy relative to hCG. Thus, it will be useful 
for reducing the function of LH or hCG. When administered early in the follicular phase 
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of the menstrual cycle, Analog la will reduce androgen synthesis. As a consequence, 
estradiol synthesis will decline, FSH levels will rise and fertility will be stimulated. Analog 
la will also be useful for inhibiting premature luteinization of the foUicle. When 
administered in the luteal phase at about the time of expected menses, the analog will 
5 block the actions of hCG and serve as a menses inducer and an inhibitor of fertility. 
Analog la will also serve as a good starting compound to design vaccines using the 
template strategy described earlier. 

Example 17 

10 Preparation of other gonadotropins lacking asparagine-linked oligosaccharides 

The coding vectors for Analogs 2a, 5a, 6a, 7a, and 8a are readily prepared from 
Analog la and Analogs 12, 15, 16, 17, and 18. Analog la is digested with Kpnl and 
Mstll and the small fragment discarded. The large fragment is ligated separately to the 
small fragment prepared by KpnI-Mstll digestion of the coding vectors for Analogs 12, 

15 15, 16, 17, and 18. Analogs 2a, 5a, 6a, 7a, and 8a will bind the same antibodies and 

receptors as Analogs 2, 5, 6, 7, and 8, respectively. However, their abilities to elicit signal 
transduction will be reduced. Consequently, they will serve as inhibitors. Analog 2a will 
be effective primarily in blocking binding of hormones to LH receptors. Depending on the 
time that it is administered, Analog 2a will elicit fertility (i.e., when given early in the 

2 0 menstrual cycle) or will inhibit fertility (i.e., when given near the time of implantation or 

expected menses). In this regard Analogs la and 2a will have similar activities. Analog 5a 
will be effective primarily in blocking binding of hormones to FSH receptors. Analog 5a 
will be usefiil for suppressing hyperovarian stimulation. Analogs 6a, 7a, and 8a will be 
inhibitors of binding to LH and FSH receptors. These will be usefiil for suppressing 

2 5 hyperovarian stimulation and for blocking premature luteinization. 

The coding vectors for Analogs 3a and 4a can be made by SOEing PCR (63) in 
which Analogs 13 and 14 serve as templates. The strategy for design of the primers is 
similar as that described for the preparation of primers used to modify the expression 
vector for Analog la. When Analogs 3a and 4a are expressed in COS-7 cells, the proteins 

3 0 that are made will be recognized by the same antibodies and antisera as Analogs 3 and 4, 

respectively. Analog 3 a will be usefiil for inhibiting the activity of hormones that bind to 
LH receptors. As such it will stimulate fertility when given early in the follicular phase. 
Analog 4a will be usefiil for inhibiting the activity of FSH. Analog 3a will be usefiil as a 
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starting molecule for designing the vaccine to be used to increase fertility using the 
template strategy and antibodies that are able to partially neutralize the activity of LH. 
Analog 3a will also be usefuLas a starting molecule for designing the vaccine to prevent 
fertility using the template strategy and antibodies that are able to neutralize LH activity. 
5 Antibody 4a will also be usefial as a starting molecule for designing the anti-FSH vaccine 
described earlier using the template strategy. 

The coding vectors for Analogs 9a and 10a can be prepared from the coding 
vector for Analog 4a. The coding vector for Analog 4a is digested with Ball and Kpnl 
and the small fragment discarded. The small Ball-Kpnl fragments from the coding vectors 

10 for analogs 19 and 20 are ligated separately with the large Analog 4a fragment to produce 
coding vectors for Analogs 9a and 10a. When produced in COS-7 cells, Analogs 9a and 
10a will have similar antibody and FSH receptor binding specificities as Analogs 9 and 10. 
Analogs 9a and 10a will have lower efficacy and will inhibit the activity of FSH. Thus, 
they will be useful for reducing ovarian hyperstimulation. They will also be useful starting 

15 vectors for the design of anti-FSH vaccines using the template strategy. 

Example 18 

Typical procedure for introducing a glvcosvlation site in a gonadotropin. 
Due to the positive influence of oligosaccharide residues on the stability of 
2 0 hormones in circulation, it is often useful to add extra oligosaccharide chains to the 

proteins. Addition of oligosaccharides can also be used to prevent unwanted antibody or 
receptor interactions. Surfaces of the protein that do not interact with receptors are useful 
places to add oligosaccharide chains that are to be used to stimulate hormone function. 
This can have a valuable effect in modulating the activities of single chain glycoprotein 

2 5 hormones or of modulating the activities of the A,6-heterodimeric glycoprotein hormones. 

For example, addition of a glycosylation signal to FSH (3-subunit at residues 71-73 to 
cause the creation of an asparagine-linked oHgosaccharide at residue 71 will lead to a 
hormone that has higher activity. Conversely, addition of a glycosylation residue in this 
region of the protein after the other glycosylations have been removed will enhance its 

3 0 inhibitory activity. Methods for performing the mutagenesis are standard in the art and 

range from total synthesis of the coding sequences by block ligation of synthetic 
oligonucleotides (54) to SOEing PGR (63). Several examples of mutagenesis by SOEing 
PCR have already been given. 

84670 



PATENT 

Atty Dkt: 295002005025 



- 52- 
Example 19 

Use of sequences other than those derived from human subunits. 
Analogs 1-20, Analogs lb- 10b and, in particular, Analogs 1 A-lOa will serve as 
5 useful starting compounds for template directed vaccine design. For development of 
hormone-specific vaccines for use in humans, it is useful to make analogs similar to those 
listed in Table 1 with a nonhuman a-subunit in place of the human a-subunit. This is 
because the bovine a-subunit renders the proteins more dissimilar to the human hormones 
than the analogs listed in Table L The approach to designing single chain glycoprotein 
10 hormones is similar to that listed in Examples 12-21 except that the coding sequences for 
the nonhuman a-subunits are substituted for the human a-subunit sequences illustrated. 
Similarly, the glycosylation signals can be removed by altering the codons for asparagine 
or serine or threonine or inserting a proline between asparagine and the serine or 
threonine. 

15 In addition, when using the template strategy to design immunogens it is often ^ 

desirable to stan with a nonhuman molecule that has little, if any affinity for the templates 
used in positive selection and to introduce residues that will result in selection. These 
analogs can be prepared by substituting the FSH, LH, or TSH p-subunit sequences from 
nonhuman sources in place of the human FSH, LH, and hCG sequences illustrated in 

2 0 Examples 5-18 and Table 1 . 
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Table 1 

Stmctures of Single Chain Gonadotropins 
Analog Composition 

5 

1 n-hCGB( 1 - 1 45)-Linker-humana( l-92)-c 

2 n-hCGB{l-l 14)-Linker-humana(l-92)-c 

3 n-hLHB(l-114)-Linker-humana(l-92)-c 

4 n-hFSH6(l-lll)-Linker-humana(l-92)-c 

10 5 n-hCGIi(l-93)-hFSHI3(88-lll)-Linker-humana(l-92)-c 

6 n-hCG6(l-100)-hFSHIJ(95-l 1 l)-Linker-humana(l-92)-c 

7 n-hCG6(l-100)-hFSHB(95-108)-Linker-humana(l-92)-c 

8 n-hCG6(l-100)-hFSHB{95-103)-DDPR-Linker-humana(I-92)-c 

9 n-hFSH6(M08)-Linker-humana(l-92)-c 
15 10 n-hFSHB(l-104)-Linker-humana(l-92)-c 

1 a n-hCG6( 1 - 1 45)[N 1 3X,N30X]-Linker-humana(l-92)[N52X,N78X]-c 

2a n-hCG6(l-l 14)[N13X,N30X]-Linker-humana(l-92)[N52X,N78X]-c 

3a n-hLHI5(l-114)[N30X]-Linker-humana(l-92)[N52X,N78X]-c 

4a n-hFSHB(l-lll)[N7X,N24X]-Linker-humana(l-92)[N52X,N78X]-c 

2 0 5a n-hCG6(l-93)[N13X.N30X]-hFSH6(88-l 1 l)-Linker-humana(l-92)[N52X,N78X]-c 

6a n-hCGB(l-100)[N13X,N30X]-hFSHB(95-l 1 l)-Linker-humana(l-92)[N52X,N78X]-c 

7a n-hCGB(l-100)[N13X,N30X]-hFSHB(95-108)-Linker-humana(l-92)[N52X,N78X]-c 

8a n-hCGB(l-100)[N13X,N30X]-hFSHB(95-103)-DDPR-Linker-humana(l-92)[N52X,N78X]-c 

9a n-hFSH6( 1- 1 08)-Linker-humaiia( 1 -92)-[N52X,N78X]-c 

25 10a n-hFSHB(l-104)[N7X,N24X]-Linker-humana(l-92)-c 

lb n-hCG6(l-145)[N13X,N30X.P78X,V79T]-Linker-huinana(l-92)[N52X,N78X]-c 

2b n-hCGB(l-l 14)[N13X,N30X,P78X,V79T]-Linker-hiuiiana(l-92)[N52X,N78X]-c 

3b n-hLHB(l-i 14)[N30X,P78X,V79T]-Linker-hiunana(l-92)[N52X,N78X]-c 

4b n-hFSH6(l-ll l)[N7X,N24X,D71N,L73T]-Linker-hiunana(l-92)[N52X,N78X]-c 

3 0 5b n-hCG6(l-93)fN13X,N30X,P78X,V79T]-hFSHB(88-l 1 l)-Linker-humana(l-92)[N52X,N78X]-c 

6b n-hCGB(l-100)[N13X,N30X,P78X,V79T]-hFSH6(95-l il)-Linker-humana(l-92)[N52X,N78X]-c 

7b n-hCGfi(l-100)[N13X,N30X,P78X,V79T]-liFSH6(95-108)-Linker-humana(l-92)[N52X,N78X]-c 

8b n-hCGB(l-100)[N13X,N30X,P78X,V79T]-hFSHB(95-103)-DDPR-Linker-huinana(l- 
92)P»J52X,N78X]-c 

35 9b n-hFSH6(I-108)[N7X,N24X,D71N,L73T]-Linker-humaiia(l-92)-[N52X,N78X]-c 

10b n-hFSH6(l-104)[N7X,N24X,D71N,L73T]-Linker-humana(l-92)-[N52X,N78X]-c 
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Definitions of the letters and sequences in Table 1 

"n-" refers to the N-terminus of the protein, 
"-c" refers to the C-terminus of the protein. 

"hCG|3(l-145)" refers to the hCG p-subunit amino acid sequence residues 1-145: 
SKEPLRPRCRPINATLAVEKEGCPVCITVNTTICAGYCPTMTRVLQGVLPA 

LPQWCNYRDVRFESIRLPGCPRGVNPWSYAVALSCQCALCRRSTTDCGGPKD 

HPLTCDDPRFQDSSSSKAPPPSLPSPSRLPGPSDTPILPQ 

"hCGP(l-114)" refers to the hCG p-subunit amino acid sequence residues 1-1 14: 
SKEPLRPRCRPINATLAVEKEGCPVCITVNTTICAGYCPTMTRVLQGVLPA 

LPQWCNYRDVRFESIRLPGCPRGVNPWSYAVALSCQCALCRRSTTDCGGPKD 

HPLTCDDPR 

"hCGP(l-93)" refers to the hCG p-subunit amino acid sequence residues 1-93: . 

SKEPLRPRCRPINATLAVEKEGCPVCITVNTTICAGYCPTMTRVLQGVLPA 
LPQVVCNYRDVRFESIRLPGCPRGVNPWSYAVALSCQCALC 

"hLHP(l-l 14)" refers to the hLH P-subunit amino acid sequence residues 1-1 14: 

SREPLRPWCHPINAILAVEKEGCPVCITVNTTICAGYCPTMMRVLQAVLPP 
LPQWCTYRDVRFESIRLPGCPRGVDPWSFPVALSCRCGPCRRSTSDCGGPKDH 
PLTCDHPQ 

"hFSHP(l-l 1 1)" refers to the hFSH P-subunit amino acid sequence residues 1- 

111: 

NSCELTNITIAVEKEGCGFCITmTTWCAGYCYTRDLVYKDPARPKIQKTC 
TFKELVYETVRVPGCAHHADSLYTYPVATQCHCGKCDSDSTDCTVRGLGPSYCS 
FGEMKE 

"hFSHp(l-108)" refers to the hFSH p-subunit amino acid sequence residues 1- 

108: 



84670 



PATENT 

Atty Dkt; 295002005025 

- 55- 

NSCELTNITIAVEKEGCGFCITINTTWCAGYCYTRDLVYKDPARPKIQKTC 
TFKEL\ni^TVRVPGCAHHADSLYTYPVATQCHCGKCDSDSTDCTVRGLGPSYCS 
FGE 

"hFSHp(l-104)" refers to the hFSH P-subunit amino acid sequence residues 1- 

104: 

NSCELTNITIAVEKEGCGFCITINTTWCAGYCYTRDLVYKDPARPKIQKTC 
TFKELVYETVRVPGCAHHADSLYTYPVATQCHCGKCDSDSTDCTVRGLGPSYC 
"hFSHp(88-l 1 1)" refers to the hFSH p-subunit amino acid sequence residues 88- 

111: 

DSDSTDCTVRGLGPSYCSFGEMKE 

"hFSHp(95-l 1 1)" refers to the hFSH P-subunit amino acid sequence residues 95- 

111: 

TVRGLGPSYCSFGEMKE 

"hFSHP(95-108)" refers to the hFSH P-subunit amino acid sequence residues 95- 

108: 

TVRGLGPSYCSFGE 

"hFSHP(95-103)" refers to the hFSH P-subunit amino acid sequence residues 95- 

103: 

TVRGLGPSY 

"N13X" refers to the substitution of glutamine or other amino acid for hCG P- 
subunit residue asparagine 1 3 and analogs 

"N30X" refers to the substitution of glutamine or other amino acid for hCG or 
hLH P-subunit residue asparagine 30 and analogs 

"N52X" refers to the substitution of glutamine or other amino acid for human a- 
subunit residue asparagine 52 and analogs 

"N78X" refers to the substitution of glutamine or other amino acid for human a- 
subunit residue asparagine 78 and analogs 



84670 



PATENT 

Atty Dkt: 295002005025 



-56- 

T78X" refers to the substitution of any amino acid except proline for proline 78 in 
the p-subunits of hCG or hLH and analogs 

"V79T" refers to the substitution of threonine or serine for valine 79 in hCG or 
hLH P-subunitL' and analogs 
5 "D71N" refers to the substitution of asparagine for aspartic acid 71 in hFSH |3- 

subunits and analogs 

"L73T" refers to the substitution of threonine or serine for leucine 73 in hFSH p- 
subunits and analogs 

"humana(l-92)*' refers to the human a-subunit sequence residues 1-92 
10 APDVQDCPECTLQENPFFSQPGAPILQCMGCCFSRAYPTPLRSKKTMLVQK 
NVTSESTCCVAKSYNRVTVMGGFKVENHTACHCSTCYYHKS 

"Linker" refers to a sequence containing repeating glycine and serine amino acids 
such as GS, GSGS, GSGSGS, GSGSGSGS, GSGSGSGSGS or any other sequence of 
amino acids that permits the P- and a-subunit sequences of the single chain gonadotropin 
15 to form a complex in which the a- and p-subunit portions combine with the p- and a- 
subunit portions of the same or other molecule. 

"DDPR" refers to the amino acid sequence Asparagine- Asparagine-Proline- 
Arginine. 

20 Notes for Table 1 : 

1 . The order of the components from left to right in the table is the order in 
which the components occur in the protein from the amino-terminus to the carboxy- 
terminus. 

25 2. Due to the high conservation of sequence in all vertebrate gonadotropins 

that can be seen from the alignment of their cysteine residues, single chain gonadotropins 
can be prepared by substitution of any homologous residues for the corresponding 
portions of the hCG, hLH, and hFSH P-subunits. 
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3. The sequence of the other vertebrate gonadotropin a-subunits can be 
substituted for humanA(l-92). This includes but is not limited to bovine a-subunit 
residues 1-96. 

4. As shown, the order of the components has the sequences derived from the 
5 j3-subunit amino-terminal of the sequences derived from the a-subunit. The order of the 

components in the table can be reversed such that the a-subunit sequences are amino- 
terminal of the p-subunit sequences. 

5. The amino acid sequences are shown in the standard single letter code 
except as noted. 

10 6. Coding sequences for all these analogs can be made by standard 

recombinant DNA methods that are well known in the art. One procedure for making 
these is that provided by Campbell et al. (54). They can be expressed in eukaryotic cells 
by methods well known in the art using vectors that have been designed for eukaryotic 
expression and that are available from InVitrogen, San Diego, CA. Those that do not 

15 contain oligosaccharide chains can also be made in E. coli by methods well known in the 
art using vectors such as the pET vectors that can be obtained from Novagen. 

7. The glycosylation sites at hCG p-subunit asparagines 13 and/or 30 can be 
destroyed by substitution of the asparagine as illustrated and/or by substitution of residues 
14 and/or 3 1 with a proline and/or by substitution of residues 15 and/or 32 with any other 

2 0 amino acid other than serine or threonine. 

8. The glycosylation site at hLH p-subunit asparagine 30 can be destroyed by 
substitution of the asparagine as illustrated and/or by substitution of residue 3 1 with a 
proline and/or by substitution of residue 32 with any other amino acid other than serine or 
threonine. 

25 9, The glycosylation sites at human a-subunit asparagines 52 and/or 78 can be 

destroyed by substitution of the asparagine as illustrated and/or by substitution of residues 
53 and/or 79 with a proline and/or by substitution of residues 54 and/or 80 with any other 
amino acid other than serine or threonine. 

10, The glycosylation sites at nonhuman a-subunit asparagines 56 and/or 82 

3 0 can be destroyed by substitution of the asparagine with any other amino acid and/or by 

substitution of residues 57 and/or 83 with a proline and/or by substitution of residues 58 
and/or 84 with any other amino acid other than serine or threonine. 



84670 



PATENT 

Atty Dkt: 295002005025 



- 58 - 
Table 2 



Properties and uses of the analogs illustrated in Table 1 



10 



15 



25 



30 



35 



40 



2 0 2a 



Analog 


Activity- 


Use 


1 


LH 


Induce ovulation; Increase male fertility 


2 


LH 


Induce ovulation; Increase male fertility 


3 


LH 


Induce ovulation; Increase male fertility 


4 


FSH 


Induce follicle development; Increase male fertility 


5 


FSH 


Induce follicle development; Increase male fertility 


6 


T70TT T TT 

FSH and LH 


Induce follicle development; Increase male fertility 


/ 


TTCTT T TT 

FSH and LH 


Induce follicle development; Increase male fertility 


Q 
O 


T7C1LT T TT 

r6H and LH 


Induce lollicle development; Increase male fertility 


Q 




Induce follicle development; Increase male fertility 






Induce follicle development; Increase male fertility 


1 ^ 

la 


Anti-LH 


*Faciiitate ovulation; Terminate pregnancy; 






Reduce androgen secretion 


2a 


A T T T 

Anti-LH 


*Facilitate ovulation; Terminate pregnancy; 






Reduce androgen ecretion 


3a 


A T TT 

Anti-LH 


^Facilitate ovulation; Terminate pregnancy; 






Reduce androgen secretion 


4a 


Antl-r6ri 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


5a 


^•-inti -rSH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


oa 


Anti-FSH and Anti-LH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 




* J.. TTC^TT 1 A J.1 T TT 

ARti-FSH and Anti-LH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


ca 


A T7CLJ n-n^ A ■^4-1 T TT 

Anti-rSH and Anti-LH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


ya 


A T7CTU 

Anti-r" SH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


lua 


Antl-r bri 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


lb 


Anti-LH 


*Facilitate ovulation; Terminate pregnancy; 






Reduce androgen secretion 


2b 


Anti-LH 


*Facilitate ovulation; Terminate pregnancy; 






Reduce androgen secretion 


3b 


Anti-LH 


^Facilitate ovulation; Terminate pregnane^-; 






Reduce androgen secretion 


4b 


Anti-FSH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


5b 


Anti-FSH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


6b 


Anti-FSH and Anti-LH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


7b 


Anti-FSH and Anti-LH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


8b 


Anti-FSH and Anti-LH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


9b 


Anti-FSH 


Treat ovarian hyperstimulation; Reduce spermatogenesis 


10b 


Anti-FSH 


Treat ovarian hyperstimulation; Reduce spermatogenesis. 
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The compounds of the present invention can be administered to mammals, e.g., 
animals or humans, in amounts effective to provide the desired therapeutic effect. Since 
the activity of the compounds and the degree of the desired therapeutic effect vary, the 
5 dosage level of the compound employed will also vary. The actual dosage administered 
will also be determined by such generally recognized factors as the body weight of the 
patient and the individual hypersensitiveness of the particular patient. 

Throughout this application, various publications have been referenced. The 
disclosures in these publications are incorporated herein by reference in order to more 
1 0 fully describe the state of the art. 
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Claims 

1 . A DNA or RNA molecule which comprises a nucleotide sequence that 
encodes single-chain protein which is an agonist or antagonist of a hormone selected from 

5 the group consisting of luteinizing hormone (LH), follicle stimulating hormone (FSH), 
thyroid stimulating hormone (TSH) and chorionic gonadotropin (CG), which single-chain 
protein has an amino acid sequence of the formula 

p-(linker)n-a or 

a-(linker)n-P 

10 wherein p is the (3 subunit of LH, FSH, TSH or CG or a variant thereof; 

"linker" refers to a peptide linker containing 1-100 amino acids; 
n is 0 or 1 , and 

a represents the amino acid sequence of the a subunit common to LH, FSH, TSH 
and CG or a variant thereof 

15 

2. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence 
encodes a protein wherein n is zero. 

3. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence 
20 encodes a protein wherein n is 1 and the linker is a complete CTP unit consisting of amino 

acid residues 1 12-1 18 to 145 of human chorionic gonadotropin p subunit. 

4. The DNA or RNA molecule of claim 4 wherein the nucleotide sequence 
encodes a protein wherein n is 1 and the linker is a peptide containing 1-16 amino acids. 

25 

5. The DNA or RNA molecule of claim 6 wherein the nucleotide sequence 
encodes a protein wherein the linker is a glycine/serine repeat. 
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6. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence 
encodes a protein wherein the a subunit or p subunit or both are modified by the insertion 
of a complete or partial CTP unit or variant thereof into a noncritical region thereof and/or 
wherein said linker includes a complete or partial CTP unit or variant thereof, wherein 
CTP refers to the amino acid sequence found at the carboxy terminus of human chorionic 
gonadotropin p subunit which extends from amino acid residues 1 12-118 to residue 145, 
or a portion thereof or a variant thereof 

7. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence 
encodes a protein wherein said variants contain 1-5 conservative amino acid substitutions 
as referred to the native forms or are truncated forms of said sequences or both. 

8. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence: 
encodes a protein wherein the a and p subunits are human a and p subunits or their 
variants. 

9. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence 
encodes a protein wherein selected from the group consisting of formulas 1-10, la- 10a, 
and lb- 10b of Table 1. 

10. The DNA or RNA molecule of claim 1 wherein the nucleotide sequence 
encodes a protein wherein p is the p subunit of TSH or a variant thereof 

1 L An expression system for the production of an agonist or antagonist of LH, 
FSH, TSH or CG which comprises the nucleotide sequence of claim 1 operably linked to 
control sequences which effects its expression in a compatible host cell. 



84670 



PATENT 

Atty Dkt; 295002005025 



-62- 

12. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein n is zero. 

1 3 . The expression system of claim 1 1 wherein the nucleotide sequence 

5 encodes a protein wherein n is 1 and the linker is a complete CTP unit consisting of amino 
acid residues 1 12-118 to 145 of human chorionic gonadotropin P subunit. 

14. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein n is 1 and the linker is a peptide containing 1-16 amino acids. 

10 

15. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein the linker is a glycine/serine repeat. 

16. The expression system of claim 1 1 wherein the nucleotide sequence 

15 encodes a protein wherein the a subunit or p subunit or both are modified by the insertion 
of a complete or partial CTP unit or variant thereof into a noncritical region thereof and/or 
wherein said linker includes a complete or partial CTP unit or variant thereof, wherein 
CTP refers to the amino acid sequence found at the carboxy terminus of human chorionic 
gonadotropin P subunit which extends from amino acid residues 1 12-1 18 to residue 145, 

2 0 or a portion thereof or a variant thereof 

17. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein said variants contain 1-5 conservative amino acid substitutions 
as referred to the native forms or are truncated forms of said sequences or both. 

25 

18. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein the a and p subunits are human a and p subunits or their 
variants. 
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19. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein selected from the group consisting of formulas 1-10, la- 10a, 
and lb- 10b of Table 1. 

5 

20. The expression system of claim 1 1 wherein the nucleotide sequence 
encodes a protein wherein P is the P subunit of TSH or a variant thereof 

21. Recombinant host cells modified to contain the expression system of claim 

10 11. 

Recombinant host cells modified to contain the expression system of claim 
Recombinant host cells modified to contain the expression system of claim 
Recombinant host cells modified to contain the expression system of claim 
Recombinant host cells modified to contain the expression system of claim 

26. Recombinant host cells modified to contain the expression system of claim 

25 16. 

27. Recombinant host cells modified to contain the expression system of claim 

17. 



22. 



12. 



15 23. 



13. 



24. 



14. 



20 



25. 



15. 
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28. Recombinant host cells modified to contain the expression system of claim 

18. 

5 29, Recombinant host cells modified to contain the expression system of claim 

19. 

30. Recombinant host cells modified to contain the expression system of claim 

20. 

10 

31. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 21 under conditions wherein said 
agonist or antagonist is produced and 

optionally recovering said agonist or antagonist ft*om the cell culture. 

15 

32. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 22 under conditions wherein said 
agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 

20 

33. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 23 under conditions wherein said 
agonist or antagonist is produced and 

optionally recovering said agonist or antagonist fi-om the cell culture. 

25 

34. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 24 under conditions wherein said 
agonist or antagonist is produced and 
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optionally recovering said agonist or antagonist from the cell culture. 



35. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 25 under conditions wherein said 

5 agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 

36. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 26 under conditions wherein said 

10 agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 

37. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 27 under conditions wherein said 

15 agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 

38. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 28 under conditions wherein said 

20 agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 

39. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 29 under conditions wherein said 

25 agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 
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40. A method to produce an agonist or antagonist of LH, FSH, TSH or CG 
which method comprises culturing the cells of claim 30 under conditions wherein said 
agonist or antagonist is produced and 

optionally recovering said agonist or antagonist from the cell culture. 
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Abstract 



Single-chain forms of the glycoprotein hormone quartet, at least some members of 
which are found in most vertebrates, are disclosed. The a and p subunits of the wild-type 
heterodimers or their variants or their fragments are covalently linked, optionally through 
a linker moiety. Some of the single-chain forms are agonists ^nd others antagonists of the 
glycoprotein hormone activity. 
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Figure 1 
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